:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Han, Junxiao, Wang, Yarong, Gu, Xiaodong, Gao, Cuiyun, Wan, Yao, Han, Song, Lo, David, Deng, Shuiguang
Format:	Preprint
Published:	2025
Subjects:	Software Engineering Artificial Intelligence
Online Access:	https://arxiv.org/abs/2508.09791
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection
by: Han, Junxiao, et al.
Published: (2025)

CodeGlance: Understanding Code Reasoning Challenges in LLMs through Multi-Dimensional Feature Analysis
by: Wang, Yunkun, et al.
Published: (2026)

Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair
by: Li, Guochang, et al.
Published: (2024)

Revisiting Vulnerability Patch Localization: An Empirical Study and LLM-Based Solution
by: Xu, Haoran, et al.
Published: (2025)

ChatUniTest: A Framework for LLM-Based Test Generation
by: Chen, Yinghao, et al.
Published: (2023)

GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion
by: Wang, Baoyi, et al.
Published: (2026)

GRACE: Graph-Guided Repository-Aware Code Completion through Hierarchical Code Fusion
by: Wang, Xingliang, et al.
Published: (2025)

A Deep Dive into Retrieval-Augmented Generation for Code Completion: Experience on WeChat
by: Yang, Zezhou, et al.
Published: (2025)

An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities
by: Yang, Zezhou, et al.
Published: (2025)

EpiDroid: Dependency-Guided Recomposition for Deep State Discovery in Mobile GUI Testing
by: Song, Jiahui, et al.
Published: (2026)

Search-Based LLMs for Code Optimization
by: Gao, Shuzheng, et al.
Published: (2024)

SR-Eval: Evaluating LLMs on Code Generation under Stepwise Requirement Refinement
by: Zhan, Zexun, et al.
Published: (2025)

MigrateLib: a tool for end-to-end Python library migration
by: Islam, Mohayeminul, et al.
Published: (2025)

APIGen: Generative API Method Recommendation
by: Chen, Yujia, et al.
Published: (2024)

Benchmarking Multimodal LLMs on Code Generation for Complex Interactive Webpages
by: Wu, Fan, et al.
Published: (2026)

Can Vision-Language Models Handle Long-Context Code? An Empirical Study on Visual Compression
by: Zhong, Jianping, et al.
Published: (2026)

A Systematic Literature Review of Code Hallucinations in LLMs: Characterization, Mitigation Methods, Challenges, and Future Directions for Reliable AI
by: Gao, Cuiyun, et al.
Published: (2025)

SPENCER: Self-Adaptive Model Distillation for Efficient Code Retrieval
by: Gu, Wenchao, et al.
Published: (2025)

Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
by: Hu, Ruida, et al.
Published: (2025)

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios
by: Hu, Ruida, et al.
Published: (2026)

What Makes Good In-context Demonstrations for Code Intelligence Tasks with LLMs?
by: Gao, Shuzheng, et al.
Published: (2023)

Towards Mitigating API Hallucination in Code Generated by LLMs with Hierarchical Dependency Aware
by: Chen, Yujia, et al.
Published: (2025)

HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs
by: Wu, Yusen, et al.
Published: (2026)

ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code
by: Feng, Jia, et al.
Published: (2024)

When Model Editing Meets Service Evolution: A Knowledge-Update Perspective for Service Recommendation
by: Fan, Guodong, et al.
Published: (2026)

A Closer Look into Transformer-Based Code Intelligence Through Code Transformation: Challenges and Opportunities
by: Li, Yaoxian, et al.
Published: (2022)

Schedule-and-Calibrate: Utility-Guided Multi-Task Reinforcement Learning for Code LLMs
by: Chen, Yujia, et al.
Published: (2026)

VecIntrinBench: Benchmarking Cross-Architecture Intrinsic Code Migration for RISC-V Vector
by: Han, Liutong, et al.
Published: (2025)

SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning
by: Gao, Shuzheng, et al.
Published: (2025)

Boosting Vulnerability Detection of LLMs via Curriculum Preference Optimization with Synthetic Reasoning Data
by: Wen, Xin-Cheng, et al.
Published: (2025)

VulEval: Towards Repository-Level Evaluation of Software Vulnerability Detection
by: Wen, Xin-Cheng, et al.
Published: (2024)

A Real-World Benchmark for Evaluating Fine-Grained Issue Solving Capabilities of Large Language Models
by: Hu, Ruida, et al.
Published: (2024)

AndroLibZoo: A Reliable Dataset of Libraries Based on Software Dependency Analysis
by: Samhi, Jordan, et al.
Published: (2023)

CatchAll: Repository-Aware Exception Handling with Knowledge-Guided LLMs
by: Tao, Qingxiao, et al.
Published: (2026)

The Current Challenges of Software Engineering in the Era of Large Language Models
by: Gao, Cuiyun, et al.
Published: (2024)

Analyzing C/C++ Library Migrations at the Package-level: Prevalence, Domains, Targets and Rationals across Seven Package Management Tools
by: Gu, Haiqiao, et al.
Published: (2025)

APIRAT: Integrating Multi-source API Knowledge for Enhanced Code Translation with LLMs
by: Wang, Chaofan, et al.
Published: (2025)

RAG or Fine-tuning? A Comparative Study on LCMs-based Code Completion in Industry
by: Wang, Chaozheng, et al.
Published: (2025)

Automated Prompt Generation for Code Intelligence: An Empirical study and Experience in WeChat
by: Ji, Kexing, et al.
Published: (2025)

LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation
by: Kuhar, Sachit, et al.
Published: (2024)