:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liang, Xiaoyun, Ren, Jingyi, Qi, Jiayi, Peng, Chao, Jiang, Bo
Format:	Preprint
Published:	2024
Subjects:	Software Engineering Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.08069
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering
by: Hu, Ruida, et al.
Published: (2024)

Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development
by: Wang, Xinchen, et al.
Published: (2026)

MarsCode Agent: AI-native Automated Bug Fixing
by: Liu, Yizhou, et al.
Published: (2024)

Beyond Code Snippets: Benchmarking LLMs on Repository-Level Question Answering
by: Alebachew, Yoseph Berhanu, et al.
Published: (2026)

More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents
by: Gao, Pengfei, et al.
Published: (2025)

Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents
by: Chen, Zhi, et al.
Published: (2026)

RedCode: Risky Code Execution and Generation Benchmark for Code Agents
by: Guo, Chengquan, et al.
Published: (2024)

SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion
by: Ma, George, et al.
Published: (2025)

Robustness and Reasoning Fidelity of Large Language Models in Long-Context Code Question Answering
by: Maharaj, Kishan, et al.
Published: (2026)

Multimodal Auto Validation For Self-Refinement in Web Agents
by: Azam, Ruhana, et al.
Published: (2024)

Auto-SPT: Automating Semantic Preserving Transformations for Code
by: Hooda, Ashish, et al.
Published: (2025)

SEMAG: Self-Evolutionary Multi-Agent Code Generation
by: Peng, Yulin, et al.
Published: (2026)

Reducing Cost of LLM Agents with Trajectory Reduction
by: Xiao, Yuan-An, et al.
Published: (2025)

Code Review Agent Benchmark
by: Zhang, Yuntong, et al.
Published: (2026)

RepoReviewer: A Local-First Multi-Agent Architecture for Repository-Level Code Review
by: Zhang, Peng
Published: (2026)

AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing
by: Nunez, Ana, et al.
Published: (2024)

AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
by: Lin, Zijie, et al.
Published: (2025)

How Do Agents Perform Code Optimization? An Empirical Study
by: Peng, Huiyun, et al.
Published: (2025)

An Empirical Study on LLM-based Agents for Automated Bug Fixing
by: Meng, Xiangxin, et al.
Published: (2024)

Workflows vs Agents for Code Translation
by: Gray, Henry, et al.
Published: (2025)

The Conversations Beneath the Code: Triadic Data for Long-Horizon Software Engineering Agents
by: Kim, Yelin
Published: (2026)

AInsteinBench: Benchmarking Coding Agents on Scientific Repositories
by: Duston, Titouan, et al.
Published: (2025)

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale
by: Phan, Huy Nhat, et al.
Published: (2024)

DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems
by: Ma, Ming, et al.
Published: (2025)

LocAgent: Graph-Guided LLM Agents for Code Localization
by: Chen, Zhaoling, et al.
Published: (2025)

AutoMCQ -- Automatically Generate Code Comprehension Questions using GenAI
by: Goodfellow, Martin, et al.
Published: (2025)

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
by: Ni, Ziyi, et al.
Published: (2025)

Beyond Local Code Optimization: Multi-Agent Reasoning for Software System Optimization
by: Peng, Huiyun, et al.
Published: (2026)

Can Coding Agents Be General Agents?
by: Ivanov, Maksim, et al.
Published: (2026)

Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration
by: Hymel, Cory, et al.
Published: (2024)

DeepCode: Open Agentic Coding
by: Li, Zongwei, et al.
Published: (2025)

AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents
by: Pautsch, Erik, et al.
Published: (2025)

Scaling Coding Agents via Atomic Skills
by: Ma, Yingwei, et al.
Published: (2026)

Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
by: Hu, Ruida, et al.
Published: (2025)

Theory of Code Space: Do Code Agents Understand Software Architecture?
by: Sapunov, Grigory
Published: (2026)

CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation
by: Wang, Xinchen, et al.
Published: (2025)

AutoICE: Automatically Synthesizing Verifiable C Code via LLM-driven Evolution
by: Luo, Weilin, et al.
Published: (2025)

COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis
by: Yang, Weiqing, et al.
Published: (2024)

TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment
by: Yuan, Zhiqiang, et al.
Published: (2024)

AutoCodeRover: Autonomous Program Improvement
by: Zhang, Yuntong, et al.
Published: (2024)