Saved in:
| Main Authors: | Torun, Utku Boran, Demircan, Mehmet Taha, Gön, Mahmut Furkan, Tüzün, Eray |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08005 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding the Limits of Automated Evaluation for Code Review Bots in Practice
by: Karakaya, Veli, et al.
Published: (2026)
by: Karakaya, Veli, et al.
Published: (2026)
Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports
by: Gon, Mahmut Furkan, et al.
Published: (2026)
by: Gon, Mahmut Furkan, et al.
Published: (2026)
ImproBR: Bug Report Improver Using LLMs
by: Akyol, Emre Furkan, et al.
Published: (2026)
by: Akyol, Emre Furkan, et al.
Published: (2026)
Evaluation of LLM-Based Software Engineering Tools: Practices, Challenges, and Future Directions
by: Torun, Utku Boran, et al.
Published: (2026)
by: Torun, Utku Boran, et al.
Published: (2026)
Agents in the Sandbox: End-to-End Crash Bug Reproduction for Minecraft
by: Yapağcı, Eray, et al.
Published: (2025)
by: Yapağcı, Eray, et al.
Published: (2025)
Towards Automated Detection of Inline Code Comment Smells
by: Oztas, Ipek, et al.
Published: (2025)
by: Oztas, Ipek, et al.
Published: (2025)
Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence
by: Geruslu, Vehid, et al.
Published: (2026)
by: Geruslu, Vehid, et al.
Published: (2026)
Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review
by: Kamalı, Hüseyin Özgür, et al.
Published: (2026)
by: Kamalı, Hüseyin Özgür, et al.
Published: (2026)
Evaluating Large Language Models for Code Review
by: Cihan, Umut, et al.
Published: (2025)
by: Cihan, Umut, et al.
Published: (2025)
Past, Present and Future: Exploring Adaptive AI in Software Development Bots
by: Elsisi, Omar, et al.
Published: (2025)
by: Elsisi, Omar, et al.
Published: (2025)
The Future of Generative AI in Software Engineering: A Vision from Industry and Academia in the European GENIUS Project
by: Gröpler, Robin, et al.
Published: (2025)
by: Gröpler, Robin, et al.
Published: (2025)
Future of Code with Generative AI: Transparency and Safety in the Era of AI Generated Software
by: Hanson, David
Published: (2025)
by: Hanson, David
Published: (2025)
A Survey of Bugs in AI-Generated Code
by: Gao, Ruofan, et al.
Published: (2025)
by: Gao, Ruofan, et al.
Published: (2025)
Evaluating the Impact of Data Cleaning on the Quality of Generated Pull Request Descriptions
by: Tire, Kutay, et al.
Published: (2025)
by: Tire, Kutay, et al.
Published: (2025)
Large Language Models to Enhance Business Process Modeling: Past, Present, and Future Trends
by: Bettencourt, João, et al.
Published: (2026)
by: Bettencourt, João, et al.
Published: (2026)
BugBlitz-AI: An Intelligent QA Assistant
by: Yao, Yi, et al.
Published: (2024)
by: Yao, Yi, et al.
Published: (2024)
PR-Aware Automated Unit Test Generation: Challenges and Opportunities
by: Haratian, Vahid, et al.
Published: (2026)
by: Haratian, Vahid, et al.
Published: (2026)
Bug Analysis Towards Bug Resolution Time Prediction
by: Ozkan, Hasan Yagiz, et al.
Published: (2024)
by: Ozkan, Hasan Yagiz, et al.
Published: (2024)
MarsCode Agent: AI-native Automated Bug Fixing
by: Liu, Yizhou, et al.
Published: (2024)
by: Liu, Yizhou, et al.
Published: (2024)
BugSpotter: Automated Generation of Code Debugging Exercises
by: Pădurean, Victor-Alexandru, et al.
Published: (2024)
by: Pădurean, Victor-Alexandru, et al.
Published: (2024)
Can We Enhance Bug Report Quality Using LLMs?: An Empirical Study of LLM-Based Bug Report Generation
by: Acharya, Jagrit, et al.
Published: (2025)
by: Acharya, Jagrit, et al.
Published: (2025)
Automated Duplicate Bug Report Detection in Large Open Bug Repositories
by: Laney, Clare E., et al.
Published: (2025)
by: Laney, Clare E., et al.
Published: (2025)
One Bug, Hundreds Behind: LLMs for Large-Scale Bug Discovery
by: Wu, Qiushi, et al.
Published: (2025)
by: Wu, Qiushi, et al.
Published: (2025)
Automated Classification of Human Code Review Comments with Large Language Models
by: Çağlar, Semih, et al.
Published: (2026)
by: Çağlar, Semih, et al.
Published: (2026)
Bugs in Large Language Models Generated Code: An Empirical Study
by: Tambon, Florian, et al.
Published: (2024)
by: Tambon, Florian, et al.
Published: (2024)
BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills
by: Sonwane, Atharv, et al.
Published: (2025)
by: Sonwane, Atharv, et al.
Published: (2025)
Automated Code Review In Practice
by: Cihan, Umut, et al.
Published: (2024)
by: Cihan, Umut, et al.
Published: (2024)
Software Reuse in the Generative AI Era: From Cargo Cult Towards AI Native Software Engineering
by: Mikkonen, Tommi, et al.
Published: (2025)
by: Mikkonen, Tommi, et al.
Published: (2025)
Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries
by: Nirujan, Hinduja, et al.
Published: (2026)
by: Nirujan, Hinduja, et al.
Published: (2026)
PyResBugs: A Dataset of Residual Python Bugs for Natural Language-Driven Fault Injection
by: Cotroneo, Domenico, et al.
Published: (2025)
by: Cotroneo, Domenico, et al.
Published: (2025)
Can GPT-O1 Kill All Bugs? An Evaluation of GPT-Family LLMs on QuixBugs
by: Hu, Haichuan, et al.
Published: (2024)
by: Hu, Haichuan, et al.
Published: (2024)
AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions
by: Wang, Xinchen, et al.
Published: (2024)
by: Wang, Xinchen, et al.
Published: (2024)
Benchmarking Mythos-Linked Bug Rediscovery
by: David, Isaac, et al.
Published: (2026)
by: David, Isaac, et al.
Published: (2026)
RLocator: Reinforcement Learning for Bug Localization
by: Chakraborty, Partha, et al.
Published: (2023)
by: Chakraborty, Partha, et al.
Published: (2023)
RefExpo: Unveiling Software Project Structures through Advanced Dependency Graph Extraction
by: Haratian, Vahid, et al.
Published: (2024)
by: Haratian, Vahid, et al.
Published: (2024)
SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs
by: Pham, Minh V. T., et al.
Published: (2025)
by: Pham, Minh V. T., et al.
Published: (2025)
The Future of Software Testing: AI-Powered Test Case Generation and Validation
by: Baqar, Mohammad, et al.
Published: (2024)
by: Baqar, Mohammad, et al.
Published: (2024)
Evaluating LLM-Based Test Generation Under Software Evolution
by: Haroon, Sabaat, et al.
Published: (2026)
by: Haroon, Sabaat, et al.
Published: (2026)
BLAgent: Agentic RAG for File-Level Bug Localization
by: Mamun, Md Afif Al, et al.
Published: (2026)
by: Mamun, Md Afif Al, et al.
Published: (2026)
Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair
by: de-Fitero-Dominguez, David, et al.
Published: (2025)
by: de-Fitero-Dominguez, David, et al.
Published: (2025)
Similar Items
-
Understanding the Limits of Automated Evaluation for Code Review Bots in Practice
by: Karakaya, Veli, et al.
Published: (2026) -
Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports
by: Gon, Mahmut Furkan, et al.
Published: (2026) -
ImproBR: Bug Report Improver Using LLMs
by: Akyol, Emre Furkan, et al.
Published: (2026) -
Evaluation of LLM-Based Software Engineering Tools: Practices, Challenges, and Future Directions
by: Torun, Utku Boran, et al.
Published: (2026) -
Agents in the Sandbox: End-to-End Crash Bug Reproduction for Minecraft
by: Yapağcı, Eray, et al.
Published: (2025)