:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Yiran, López, José Antonio Hernández, Nilsson, Ulf, Varró, Dániel
Format:	Preprint
Published:	2026
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2602.18537
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks
by: Wang, Yiran, et al.
Published: (2025)

Why do Machine Learning Notebooks Crash? An Empirical Study on Public Python Jupyter Notebooks
by: Wang, Yiran, et al.
Published: (2024)

Generative AI in Simulation-Based Test Environments for Large-Scale Cyber-Physical Systems: An Industrial Study
by: Sadrnezhaad, Masoud, et al.
Published: (2025)

The Power of Types: Exploring the Impact of Type Checking on Neural Bug Detection in Dynamically Typed Languages
by: Chen, Boqi, et al.
Published: (2024)

ALPINE: An adaptive language-agnostic pruning method for language models for code
by: Saad, Mootez, et al.
Published: (2024)

On Inter-dataset Code Duplication and Data Leakage in Large Language Models
by: López, José Antonio Hernández, et al.
Published: (2024)

Hierarchical Evaluation of Software Design Capabilities of Large Language Models of Code
by: Saad, Mootez, et al.
Published: (2025)

SENAI: Towards Software Engineering Native Generative Artificial Intelligence
by: Saad, Mootez, et al.
Published: (2025)

SHERPA: A Model-Driven Framework for Large Language Model Execution
by: Chen, Boqi, et al.
Published: (2025)

Concretization of Abstract Traffic Scene Specifications Using Metaheuristic Search
by: Babikian, Aren A., et al.
Published: (2023)

A Flexible Cell Classification for ML Projects in Jupyter Notebooks
by: Perez, Miguel, et al.
Published: (2024)

A Study of Using Multimodal LLMs for Non-Crash Functional Bug Detection in Android Apps
by: Ju, Bangyan, et al.
Published: (2024)

Finding the Needle in the Crash Stack: Industrial-Scale Crash Root Cause Localization with AutoCrashFL
by: Kang, Sungmin, et al.
Published: (2025)

LLM-based Satisfiability Checking of String Requirements by Consistent Data and Checker Generation
by: Chen, Boqi, et al.
Published: (2025)

Refining Fuzzed Crashing Inputs for Better Fault Diagnosis
by: Kim, Kieun, et al.
Published: (2025)

CrashJS: A NodeJS Benchmark for Automated Crash Reproduction
by: Oliver, Philip, et al.
Published: (2024)

Better Debugging: Combining Static Analysis and LLMs for Explainable Crashing Fault Localization
by: Yan, Jiwei, et al.
Published: (2024)

Logging Like Humans for LLMs: Rethinking Logging via Execution and Runtime Feedback
by: Wang, Xin, et al.
Published: (2026)

Crash-free Deductive Verifiers
by: Nauta, Wander, et al.
Published: (2026)

A Study of Scientific Computational Notebook Quality
by: Kashiwa, Shun, et al.
Published: (2026)

Method Names in Jupyter Notebooks: An Exploratory Study
by: Wong, Carol, et al.
Published: (2025)

Static Analysis Driven Enhancements for Comprehension in Machine Learning Notebooks
by: Venkatesh, Ashwin Prasad Shivarpatna, et al.
Published: (2023)

Are the Majority of Public Computational Notebooks Pathologically Non-Executable?
by: Nguyen, Tien, et al.
Published: (2025)

Understanding Feedback Mechanisms in Machine Learning Jupyter Notebooks
by: Shome, Arumoy, et al.
Published: (2024)

Mining the Characteristics of Jupyter Notebooks in Data Science Projects
by: Choetkiertikul, Morakot, et al.
Published: (2023)

Containing the Reproducibility Gap: Automated Repository-Level Containerization for Scholarly Jupyter Notebooks
by: Samuel, Sheeba, et al.
Published: (2026)

LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
by: Zhang, Xiangrui, et al.
Published: (2025)

CaveAgent: Transforming LLMs into Stateful Runtime Operators
by: Ran, Maohao, et al.
Published: (2026)

ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions
by: Prenner, Julian Aron, et al.
Published: (2025)

Beyond Crash-to-Patch: Patch Evolution for Linux Kernel Repair
by: Bai, Luyao, et al.
Published: (2026)

Integrating Code Metrics into Automated Documentation Generation for Computational Notebooks
by: Ghahfarokhi, Mojtaba Mostafavi, et al.
Published: (2026)

Predicting the Impact of Crashes Across Release Channels
by: Mujahid, Suhaib, et al.
Published: (2024)

Hidden Gems in the Rough: Computational Notebooks as an Uncharted Oasis for IDEs
by: Titov, Sergey, et al.
Published: (2024)

Typhon: Automatic Recommendation of Relevant Code Cells in Jupyter Notebooks
by: Ragkhitwetsagul, Chaiyong, et al.
Published: (2024)

Automated Modernization of Machine Learning Engineering Notebooks for Reproducibility
by: Jin, Bihui, et al.
Published: (2026)

Observing Fine-Grained Changes in Jupyter Notebooks During Development Time
by: Titov, Sergey, et al.
Published: (2025)

Human to Document, AI to Code: Comparing GenAI for Notebook Competitions
by: Settewong, Tasha, et al.
Published: (2025)

Similarity-Based Assessment of Computational Reproducibility in Jupyter Notebooks
by: Hossain, A S M Shahadat, et al.
Published: (2025)

GPTrace: Effective Crash Deduplication Using LLM Embeddings
by: Herter, Patrick, et al.
Published: (2025)

Towards Effective Detection of Ponzi schemes on Ethereum with Contract Runtime Behavior Graph
by: Liang, Ruichao, et al.
Published: (2024)