Saved in:
| Main Authors: | Wang, Yiran, López, José Antonio Hernández, Nilsson, Ulf, Varró, Dániel |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18537 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks
by: Wang, Yiran, et al.
Published: (2025)
by: Wang, Yiran, et al.
Published: (2025)
Why do Machine Learning Notebooks Crash? An Empirical Study on Public Python Jupyter Notebooks
by: Wang, Yiran, et al.
Published: (2024)
by: Wang, Yiran, et al.
Published: (2024)
Generative AI in Simulation-Based Test Environments for Large-Scale Cyber-Physical Systems: An Industrial Study
by: Sadrnezhaad, Masoud, et al.
Published: (2025)
by: Sadrnezhaad, Masoud, et al.
Published: (2025)
The Power of Types: Exploring the Impact of Type Checking on Neural Bug Detection in Dynamically Typed Languages
by: Chen, Boqi, et al.
Published: (2024)
by: Chen, Boqi, et al.
Published: (2024)
ALPINE: An adaptive language-agnostic pruning method for language models for code
by: Saad, Mootez, et al.
Published: (2024)
by: Saad, Mootez, et al.
Published: (2024)
On Inter-dataset Code Duplication and Data Leakage in Large Language Models
by: López, José Antonio Hernández, et al.
Published: (2024)
by: López, José Antonio Hernández, et al.
Published: (2024)
Hierarchical Evaluation of Software Design Capabilities of Large Language Models of Code
by: Saad, Mootez, et al.
Published: (2025)
by: Saad, Mootez, et al.
Published: (2025)
SENAI: Towards Software Engineering Native Generative Artificial Intelligence
by: Saad, Mootez, et al.
Published: (2025)
by: Saad, Mootez, et al.
Published: (2025)
SHERPA: A Model-Driven Framework for Large Language Model Execution
by: Chen, Boqi, et al.
Published: (2025)
by: Chen, Boqi, et al.
Published: (2025)
Concretization of Abstract Traffic Scene Specifications Using Metaheuristic Search
by: Babikian, Aren A., et al.
Published: (2023)
by: Babikian, Aren A., et al.
Published: (2023)
A Flexible Cell Classification for ML Projects in Jupyter Notebooks
by: Perez, Miguel, et al.
Published: (2024)
by: Perez, Miguel, et al.
Published: (2024)
A Study of Using Multimodal LLMs for Non-Crash Functional Bug Detection in Android Apps
by: Ju, Bangyan, et al.
Published: (2024)
by: Ju, Bangyan, et al.
Published: (2024)
Finding the Needle in the Crash Stack: Industrial-Scale Crash Root Cause Localization with AutoCrashFL
by: Kang, Sungmin, et al.
Published: (2025)
by: Kang, Sungmin, et al.
Published: (2025)
LLM-based Satisfiability Checking of String Requirements by Consistent Data and Checker Generation
by: Chen, Boqi, et al.
Published: (2025)
by: Chen, Boqi, et al.
Published: (2025)
Refining Fuzzed Crashing Inputs for Better Fault Diagnosis
by: Kim, Kieun, et al.
Published: (2025)
by: Kim, Kieun, et al.
Published: (2025)
CrashJS: A NodeJS Benchmark for Automated Crash Reproduction
by: Oliver, Philip, et al.
Published: (2024)
by: Oliver, Philip, et al.
Published: (2024)
Better Debugging: Combining Static Analysis and LLMs for Explainable Crashing Fault Localization
by: Yan, Jiwei, et al.
Published: (2024)
by: Yan, Jiwei, et al.
Published: (2024)
Logging Like Humans for LLMs: Rethinking Logging via Execution and Runtime Feedback
by: Wang, Xin, et al.
Published: (2026)
by: Wang, Xin, et al.
Published: (2026)
Crash-free Deductive Verifiers
by: Nauta, Wander, et al.
Published: (2026)
by: Nauta, Wander, et al.
Published: (2026)
A Study of Scientific Computational Notebook Quality
by: Kashiwa, Shun, et al.
Published: (2026)
by: Kashiwa, Shun, et al.
Published: (2026)
Method Names in Jupyter Notebooks: An Exploratory Study
by: Wong, Carol, et al.
Published: (2025)
by: Wong, Carol, et al.
Published: (2025)
Static Analysis Driven Enhancements for Comprehension in Machine Learning Notebooks
by: Venkatesh, Ashwin Prasad Shivarpatna, et al.
Published: (2023)
by: Venkatesh, Ashwin Prasad Shivarpatna, et al.
Published: (2023)
Are the Majority of Public Computational Notebooks Pathologically Non-Executable?
by: Nguyen, Tien, et al.
Published: (2025)
by: Nguyen, Tien, et al.
Published: (2025)
Understanding Feedback Mechanisms in Machine Learning Jupyter Notebooks
by: Shome, Arumoy, et al.
Published: (2024)
by: Shome, Arumoy, et al.
Published: (2024)
Mining the Characteristics of Jupyter Notebooks in Data Science Projects
by: Choetkiertikul, Morakot, et al.
Published: (2023)
by: Choetkiertikul, Morakot, et al.
Published: (2023)
Containing the Reproducibility Gap: Automated Repository-Level Containerization for Scholarly Jupyter Notebooks
by: Samuel, Sheeba, et al.
Published: (2026)
by: Samuel, Sheeba, et al.
Published: (2026)
LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
by: Zhang, Xiangrui, et al.
Published: (2025)
by: Zhang, Xiangrui, et al.
Published: (2025)
CaveAgent: Transforming LLMs into Stateful Runtime Operators
by: Ran, Maohao, et al.
Published: (2026)
by: Ran, Maohao, et al.
Published: (2026)
ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions
by: Prenner, Julian Aron, et al.
Published: (2025)
by: Prenner, Julian Aron, et al.
Published: (2025)
Beyond Crash-to-Patch: Patch Evolution for Linux Kernel Repair
by: Bai, Luyao, et al.
Published: (2026)
by: Bai, Luyao, et al.
Published: (2026)
Integrating Code Metrics into Automated Documentation Generation for Computational Notebooks
by: Ghahfarokhi, Mojtaba Mostafavi, et al.
Published: (2026)
by: Ghahfarokhi, Mojtaba Mostafavi, et al.
Published: (2026)
Predicting the Impact of Crashes Across Release Channels
by: Mujahid, Suhaib, et al.
Published: (2024)
by: Mujahid, Suhaib, et al.
Published: (2024)
Hidden Gems in the Rough: Computational Notebooks as an Uncharted Oasis for IDEs
by: Titov, Sergey, et al.
Published: (2024)
by: Titov, Sergey, et al.
Published: (2024)
Typhon: Automatic Recommendation of Relevant Code Cells in Jupyter Notebooks
by: Ragkhitwetsagul, Chaiyong, et al.
Published: (2024)
by: Ragkhitwetsagul, Chaiyong, et al.
Published: (2024)
Automated Modernization of Machine Learning Engineering Notebooks for Reproducibility
by: Jin, Bihui, et al.
Published: (2026)
by: Jin, Bihui, et al.
Published: (2026)
Observing Fine-Grained Changes in Jupyter Notebooks During Development Time
by: Titov, Sergey, et al.
Published: (2025)
by: Titov, Sergey, et al.
Published: (2025)
Human to Document, AI to Code: Comparing GenAI for Notebook Competitions
by: Settewong, Tasha, et al.
Published: (2025)
by: Settewong, Tasha, et al.
Published: (2025)
Similarity-Based Assessment of Computational Reproducibility in Jupyter Notebooks
by: Hossain, A S M Shahadat, et al.
Published: (2025)
by: Hossain, A S M Shahadat, et al.
Published: (2025)
GPTrace: Effective Crash Deduplication Using LLM Embeddings
by: Herter, Patrick, et al.
Published: (2025)
by: Herter, Patrick, et al.
Published: (2025)
Towards Effective Detection of Ponzi schemes on Ethereum with Contract Runtime Behavior Graph
by: Liang, Ruichao, et al.
Published: (2024)
by: Liang, Ruichao, et al.
Published: (2024)
Similar Items
-
JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks
by: Wang, Yiran, et al.
Published: (2025) -
Why do Machine Learning Notebooks Crash? An Empirical Study on Public Python Jupyter Notebooks
by: Wang, Yiran, et al.
Published: (2024) -
Generative AI in Simulation-Based Test Environments for Large-Scale Cyber-Physical Systems: An Industrial Study
by: Sadrnezhaad, Masoud, et al.
Published: (2025) -
The Power of Types: Exploring the Impact of Type Checking on Neural Bug Detection in Dynamically Typed Languages
by: Chen, Boqi, et al.
Published: (2024) -
ALPINE: An adaptive language-agnostic pruning method for language models for code
by: Saad, Mootez, et al.
Published: (2024)