:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Torun, Utku Boran, Demircan, Mehmet Taha, Gön, Mahmut Furkan, Tüzün, Eray
Format:	Preprint
Published:	2025
Subjects:	Software Engineering Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.08005
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Understanding the Limits of Automated Evaluation for Code Review Bots in Practice
by: Karakaya, Veli, et al.
Published: (2026)

Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports
by: Gon, Mahmut Furkan, et al.
Published: (2026)

ImproBR: Bug Report Improver Using LLMs
by: Akyol, Emre Furkan, et al.
Published: (2026)

Evaluation of LLM-Based Software Engineering Tools: Practices, Challenges, and Future Directions
by: Torun, Utku Boran, et al.
Published: (2026)

Agents in the Sandbox: End-to-End Crash Bug Reproduction for Minecraft
by: Yapağcı, Eray, et al.
Published: (2025)

Towards Automated Detection of Inline Code Comment Smells
by: Oztas, Ipek, et al.
Published: (2025)

Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence
by: Geruslu, Vehid, et al.
Published: (2026)

Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review
by: Kamalı, Hüseyin Özgür, et al.
Published: (2026)

Evaluating Large Language Models for Code Review
by: Cihan, Umut, et al.
Published: (2025)

Past, Present and Future: Exploring Adaptive AI in Software Development Bots
by: Elsisi, Omar, et al.
Published: (2025)

The Future of Generative AI in Software Engineering: A Vision from Industry and Academia in the European GENIUS Project
by: Gröpler, Robin, et al.
Published: (2025)

Future of Code with Generative AI: Transparency and Safety in the Era of AI Generated Software
by: Hanson, David
Published: (2025)

A Survey of Bugs in AI-Generated Code
by: Gao, Ruofan, et al.
Published: (2025)

Evaluating the Impact of Data Cleaning on the Quality of Generated Pull Request Descriptions
by: Tire, Kutay, et al.
Published: (2025)

Large Language Models to Enhance Business Process Modeling: Past, Present, and Future Trends
by: Bettencourt, João, et al.
Published: (2026)

BugBlitz-AI: An Intelligent QA Assistant
by: Yao, Yi, et al.
Published: (2024)

PR-Aware Automated Unit Test Generation: Challenges and Opportunities
by: Haratian, Vahid, et al.
Published: (2026)

Bug Analysis Towards Bug Resolution Time Prediction
by: Ozkan, Hasan Yagiz, et al.
Published: (2024)

MarsCode Agent: AI-native Automated Bug Fixing
by: Liu, Yizhou, et al.
Published: (2024)

BugSpotter: Automated Generation of Code Debugging Exercises
by: Pădurean, Victor-Alexandru, et al.
Published: (2024)

Can We Enhance Bug Report Quality Using LLMs?: An Empirical Study of LLM-Based Bug Report Generation
by: Acharya, Jagrit, et al.
Published: (2025)

Automated Duplicate Bug Report Detection in Large Open Bug Repositories
by: Laney, Clare E., et al.
Published: (2025)

One Bug, Hundreds Behind: LLMs for Large-Scale Bug Discovery
by: Wu, Qiushi, et al.
Published: (2025)

Automated Classification of Human Code Review Comments with Large Language Models
by: Çağlar, Semih, et al.
Published: (2026)

Bugs in Large Language Models Generated Code: An Empirical Study
by: Tambon, Florian, et al.
Published: (2024)

BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills
by: Sonwane, Atharv, et al.
Published: (2025)

Automated Code Review In Practice
by: Cihan, Umut, et al.
Published: (2024)

Software Reuse in the Generative AI Era: From Cargo Cult Towards AI Native Software Engineering
by: Mikkonen, Tommi, et al.
Published: (2025)

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries
by: Nirujan, Hinduja, et al.
Published: (2026)

PyResBugs: A Dataset of Residual Python Bugs for Natural Language-Driven Fault Injection
by: Cotroneo, Domenico, et al.
Published: (2025)

Can GPT-O1 Kill All Bugs? An Evaluation of GPT-Family LLMs on QuixBugs
by: Hu, Haichuan, et al.
Published: (2024)

AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions
by: Wang, Xinchen, et al.
Published: (2024)

Benchmarking Mythos-Linked Bug Rediscovery
by: David, Isaac, et al.
Published: (2026)

RLocator: Reinforcement Learning for Bug Localization
by: Chakraborty, Partha, et al.
Published: (2023)

RefExpo: Unveiling Software Project Structures through Advanced Dependency Graph Extraction
by: Haratian, Vahid, et al.
Published: (2024)

SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs
by: Pham, Minh V. T., et al.
Published: (2025)

The Future of Software Testing: AI-Powered Test Case Generation and Validation
by: Baqar, Mohammad, et al.
Published: (2024)

Evaluating LLM-Based Test Generation Under Software Evolution
by: Haroon, Sabaat, et al.
Published: (2026)

BLAgent: Agentic RAG for File-Level Bug Localization
by: Mamun, Md Afif Al, et al.
Published: (2026)

Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair
by: de-Fitero-Dominguez, David, et al.
Published: (2025)