Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jajal, Purvish, Jiang, Wenxin, Tewari, Arav, Kocinare, Erik, Woo, Joseph, Sarraf, Anusha, Lu, Yung-Hsiang, Thiruvathukal, George K., Davis, James C.
Format:	Preprint
Published:	2023
Subjects:	Software Engineering Machine Learning
Online Access:	https://arxiv.org/abs/2303.17708
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909302730522624
author	Jajal, Purvish Jiang, Wenxin Tewari, Arav Kocinare, Erik Woo, Joseph Sarraf, Anusha Lu, Yung-Hsiang Thiruvathukal, George K. Davis, James C.
author_facet	Jajal, Purvish Jiang, Wenxin Tewari, Arav Kocinare, Erik Woo, Joseph Sarraf, Anusha Lu, Yung-Hsiang Thiruvathukal, George K. Davis, James C.
contents	Software engineers develop, fine-tune, and deploy deep learning (DL) models using a variety of development frameworks and runtime environments. DL model converters move models between frameworks and to runtime environments. Conversion errors compromise model quality and disrupt deployment. However, the failure characteristics of DL model converters are unknown, adding risk when using DL interoperability technologies. This paper analyzes failures in DL model converters. We survey software engineers about DL interoperability tools, use cases, and pain points (N=92). Then, we characterize failures in model converters associated with the main interoperability tool, ONNX (N=200 issues in PyTorch and TensorFlow). Finally, we formulate and test two hypotheses about structural causes for the failures we studied. We find that the node conversion stage of a model converter accounts for ~75% of the defects and 33% of reported failure are related to semantically incorrect models. The cause of semantically incorrect models is elusive, but models with behaviour inconsistencies share operator sequences. Our results motivate future research on making DL interoperability software simpler to maintain, extend, and validate. Research into behavioural tolerances and architectural coverage metrics could be fruitful.
format	Preprint
id	arxiv_https___arxiv_org_abs_2303_17708
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the ONNX Ecosystem Jajal, Purvish Jiang, Wenxin Tewari, Arav Kocinare, Erik Woo, Joseph Sarraf, Anusha Lu, Yung-Hsiang Thiruvathukal, George K. Davis, James C. Software Engineering Machine Learning Software engineers develop, fine-tune, and deploy deep learning (DL) models using a variety of development frameworks and runtime environments. DL model converters move models between frameworks and to runtime environments. Conversion errors compromise model quality and disrupt deployment. However, the failure characteristics of DL model converters are unknown, adding risk when using DL interoperability technologies. This paper analyzes failures in DL model converters. We survey software engineers about DL interoperability tools, use cases, and pain points (N=92). Then, we characterize failures in model converters associated with the main interoperability tool, ONNX (N=200 issues in PyTorch and TensorFlow). Finally, we formulate and test two hypotheses about structural causes for the failures we studied. We find that the node conversion stage of a model converter accounts for ~75% of the defects and 33% of reported failure are related to semantically incorrect models. The cause of semantically incorrect models is elusive, but models with behaviour inconsistencies share operator sequences. Our results motivate future research on making DL interoperability software simpler to maintain, extend, and validate. Research into behavioural tolerances and architectural coverage metrics could be fruitful.
title	Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the ONNX Ecosystem
topic	Software Engineering Machine Learning
url	https://arxiv.org/abs/2303.17708

Similar Items