Saved in:
Bibliographic Details
Main Authors: Widyasari, Ratnadira, Sim, Sheng Qin, Lok, Camellia, Qi, Haodi, Phan, Jack, Tay, Qijin, Tan, Constance, Wee, Fiona, Tan, Jodie Ethelda, Yieh, Yuheng, Goh, Brian, Thung, Ferdian, Kang, Hong Jin, Hoang, Thong, Lo, David, Ouh, Eng Lieh
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.15481
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910309968510976
author Widyasari, Ratnadira
Sim, Sheng Qin
Lok, Camellia
Qi, Haodi
Phan, Jack
Tay, Qijin
Tan, Constance
Wee, Fiona
Tan, Jodie Ethelda
Yieh, Yuheng
Goh, Brian
Thung, Ferdian
Kang, Hong Jin
Hoang, Thong
Lo, David
Ouh, Eng Lieh
author_facet Widyasari, Ratnadira
Sim, Sheng Qin
Lok, Camellia
Qi, Haodi
Phan, Jack
Tay, Qijin
Tan, Constance
Wee, Fiona
Tan, Jodie Ethelda
Yieh, Yuheng
Goh, Brian
Thung, Ferdian
Kang, Hong Jin
Hoang, Thong
Lo, David
Ouh, Eng Lieh
contents The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the abundance of testing and debugging tools for Java. Thus, there is a need to push research on tools that can help Python developers. One factor that contributed to the rapid growth of Java testing and debugging tools is the availability of benchmarks. A popular benchmark is the Defects4J benchmark; its initial version contained 357 real bugs from 5 real-world Java programs. Each bug comes with a test suite that can expose the bug. Defects4J has been used by hundreds of testing and debugging studies and has helped to push the frontier of research in these directions. In this project, inspired by Defects4J, we create another benchmark database and tool that contain 493 real bugs from 17 real-world Python programs. We hope our benchmark can help catalyze future work on testing and debugging tools that work on Python programs.
format Preprint
id arxiv_https___arxiv_org_abs_2401_15481
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies
Widyasari, Ratnadira
Sim, Sheng Qin
Lok, Camellia
Qi, Haodi
Phan, Jack
Tay, Qijin
Tan, Constance
Wee, Fiona
Tan, Jodie Ethelda
Yieh, Yuheng
Goh, Brian
Thung, Ferdian
Kang, Hong Jin
Hoang, Thong
Lo, David
Ouh, Eng Lieh
Software Engineering
The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the abundance of testing and debugging tools for Java. Thus, there is a need to push research on tools that can help Python developers. One factor that contributed to the rapid growth of Java testing and debugging tools is the availability of benchmarks. A popular benchmark is the Defects4J benchmark; its initial version contained 357 real bugs from 5 real-world Java programs. Each bug comes with a test suite that can expose the bug. Defects4J has been used by hundreds of testing and debugging studies and has helped to push the frontier of research in these directions. In this project, inspired by Defects4J, we create another benchmark database and tool that contain 493 real bugs from 17 real-world Python programs. We hope our benchmark can help catalyze future work on testing and debugging tools that work on Python programs.
title BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies
topic Software Engineering
url https://arxiv.org/abs/2401.15481