محفوظ في:
التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Jahan, Sigma, Shah, Mehil B., Rahman, Mohammad Masudur
التنسيق: Preprint
منشور في: 2024
الموضوعات:
الوصول للمادة أونلاين:https://arxiv.org/abs/2402.01021
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1866915404856688640
author Jahan, Sigma
Shah, Mehil B.
Rahman, Mohammad Masudur
author_facet Jahan, Sigma
Shah, Mehil B.
Rahman, Mohammad Masudur
contents Software bugs cost the global economy billions of dollars annually and claim ~50\% of the programming time from software developers. Locating these bugs is crucial for their resolution but challenging. It is even more challenging in deep-learning systems due to their black-box nature. Bugs in these systems are also hidden not only in the code but also in the models and training data, which might make traditional debugging methods less effective. In this article, we conduct a large-scale empirical study to better understand the challenges of localizing bugs in deep-learning systems. First, we determine the bug localization performance of four existing techniques using 2,365 bugs from deep-learning systems and 2,913 from traditional software. We found these techniques significantly underperform in localizing deep-learning system bugs. Second, we evaluate how different bug types in deep learning systems impact bug localization. We found that the effectiveness of localization techniques varies with bug type due to their unique challenges. For example, tensor bugs were more accessible to locate due to their structural nature, while all techniques struggled with GPU bugs due to their external dependencies. Third, we investigate the impact of bugs' extrinsic nature on localization in deep-learning systems. We found that deep learning bugs are often extrinsic and thus connected to artifacts other than source code (e.g., GPU, training data), contributing to the poor performance of existing localization methods.
format Preprint
id arxiv_https___arxiv_org_abs_2402_01021
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Towards Understanding the Challenges of Bug Localization in Deep Learning Systems
Jahan, Sigma
Shah, Mehil B.
Rahman, Mohammad Masudur
Software Engineering
Software bugs cost the global economy billions of dollars annually and claim ~50\% of the programming time from software developers. Locating these bugs is crucial for their resolution but challenging. It is even more challenging in deep-learning systems due to their black-box nature. Bugs in these systems are also hidden not only in the code but also in the models and training data, which might make traditional debugging methods less effective. In this article, we conduct a large-scale empirical study to better understand the challenges of localizing bugs in deep-learning systems. First, we determine the bug localization performance of four existing techniques using 2,365 bugs from deep-learning systems and 2,913 from traditional software. We found these techniques significantly underperform in localizing deep-learning system bugs. Second, we evaluate how different bug types in deep learning systems impact bug localization. We found that the effectiveness of localization techniques varies with bug type due to their unique challenges. For example, tensor bugs were more accessible to locate due to their structural nature, while all techniques struggled with GPU bugs due to their external dependencies. Third, we investigate the impact of bugs' extrinsic nature on localization in deep-learning systems. We found that deep learning bugs are often extrinsic and thus connected to artifacts other than source code (e.g., GPU, training data), contributing to the poor performance of existing localization methods.
title Towards Understanding the Challenges of Bug Localization in Deep Learning Systems
topic Software Engineering
url https://arxiv.org/abs/2402.01021