Saved in:
| Main Authors: | Han, Junxiao, Wang, Yarong, Gu, Xiaodong, Gao, Cuiyun, Wan, Yao, Han, Song, Lo, David, Deng, Shuiguang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.09791 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection
by: Han, Junxiao, et al.
Published: (2025)
by: Han, Junxiao, et al.
Published: (2025)
CodeGlance: Understanding Code Reasoning Challenges in LLMs through Multi-Dimensional Feature Analysis
by: Wang, Yunkun, et al.
Published: (2026)
by: Wang, Yunkun, et al.
Published: (2026)
Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair
by: Li, Guochang, et al.
Published: (2024)
by: Li, Guochang, et al.
Published: (2024)
Revisiting Vulnerability Patch Localization: An Empirical Study and LLM-Based Solution
by: Xu, Haoran, et al.
Published: (2025)
by: Xu, Haoran, et al.
Published: (2025)
ChatUniTest: A Framework for LLM-Based Test Generation
by: Chen, Yinghao, et al.
Published: (2023)
by: Chen, Yinghao, et al.
Published: (2023)
GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion
by: Wang, Baoyi, et al.
Published: (2026)
by: Wang, Baoyi, et al.
Published: (2026)
GRACE: Graph-Guided Repository-Aware Code Completion through Hierarchical Code Fusion
by: Wang, Xingliang, et al.
Published: (2025)
by: Wang, Xingliang, et al.
Published: (2025)
A Deep Dive into Retrieval-Augmented Generation for Code Completion: Experience on WeChat
by: Yang, Zezhou, et al.
Published: (2025)
by: Yang, Zezhou, et al.
Published: (2025)
An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities
by: Yang, Zezhou, et al.
Published: (2025)
by: Yang, Zezhou, et al.
Published: (2025)
EpiDroid: Dependency-Guided Recomposition for Deep State Discovery in Mobile GUI Testing
by: Song, Jiahui, et al.
Published: (2026)
by: Song, Jiahui, et al.
Published: (2026)
Search-Based LLMs for Code Optimization
by: Gao, Shuzheng, et al.
Published: (2024)
by: Gao, Shuzheng, et al.
Published: (2024)
SR-Eval: Evaluating LLMs on Code Generation under Stepwise Requirement Refinement
by: Zhan, Zexun, et al.
Published: (2025)
by: Zhan, Zexun, et al.
Published: (2025)
MigrateLib: a tool for end-to-end Python library migration
by: Islam, Mohayeminul, et al.
Published: (2025)
by: Islam, Mohayeminul, et al.
Published: (2025)
APIGen: Generative API Method Recommendation
by: Chen, Yujia, et al.
Published: (2024)
by: Chen, Yujia, et al.
Published: (2024)
Benchmarking Multimodal LLMs on Code Generation for Complex Interactive Webpages
by: Wu, Fan, et al.
Published: (2026)
by: Wu, Fan, et al.
Published: (2026)
Can Vision-Language Models Handle Long-Context Code? An Empirical Study on Visual Compression
by: Zhong, Jianping, et al.
Published: (2026)
by: Zhong, Jianping, et al.
Published: (2026)
A Systematic Literature Review of Code Hallucinations in LLMs: Characterization, Mitigation Methods, Challenges, and Future Directions for Reliable AI
by: Gao, Cuiyun, et al.
Published: (2025)
by: Gao, Cuiyun, et al.
Published: (2025)
SPENCER: Self-Adaptive Model Distillation for Efficient Code Retrieval
by: Gu, Wenchao, et al.
Published: (2025)
by: Gu, Wenchao, et al.
Published: (2025)
Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
by: Hu, Ruida, et al.
Published: (2025)
by: Hu, Ruida, et al.
Published: (2025)
Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios
by: Hu, Ruida, et al.
Published: (2026)
by: Hu, Ruida, et al.
Published: (2026)
What Makes Good In-context Demonstrations for Code Intelligence Tasks with LLMs?
by: Gao, Shuzheng, et al.
Published: (2023)
by: Gao, Shuzheng, et al.
Published: (2023)
Towards Mitigating API Hallucination in Code Generated by LLMs with Hierarchical Dependency Aware
by: Chen, Yujia, et al.
Published: (2025)
by: Chen, Yujia, et al.
Published: (2025)
HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs
by: Wu, Yusen, et al.
Published: (2026)
by: Wu, Yusen, et al.
Published: (2026)
ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code
by: Feng, Jia, et al.
Published: (2024)
by: Feng, Jia, et al.
Published: (2024)
When Model Editing Meets Service Evolution: A Knowledge-Update Perspective for Service Recommendation
by: Fan, Guodong, et al.
Published: (2026)
by: Fan, Guodong, et al.
Published: (2026)
A Closer Look into Transformer-Based Code Intelligence Through Code Transformation: Challenges and Opportunities
by: Li, Yaoxian, et al.
Published: (2022)
by: Li, Yaoxian, et al.
Published: (2022)
Schedule-and-Calibrate: Utility-Guided Multi-Task Reinforcement Learning for Code LLMs
by: Chen, Yujia, et al.
Published: (2026)
by: Chen, Yujia, et al.
Published: (2026)
VecIntrinBench: Benchmarking Cross-Architecture Intrinsic Code Migration for RISC-V Vector
by: Han, Liutong, et al.
Published: (2025)
by: Han, Liutong, et al.
Published: (2025)
SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning
by: Gao, Shuzheng, et al.
Published: (2025)
by: Gao, Shuzheng, et al.
Published: (2025)
Boosting Vulnerability Detection of LLMs via Curriculum Preference Optimization with Synthetic Reasoning Data
by: Wen, Xin-Cheng, et al.
Published: (2025)
by: Wen, Xin-Cheng, et al.
Published: (2025)
VulEval: Towards Repository-Level Evaluation of Software Vulnerability Detection
by: Wen, Xin-Cheng, et al.
Published: (2024)
by: Wen, Xin-Cheng, et al.
Published: (2024)
A Real-World Benchmark for Evaluating Fine-Grained Issue Solving Capabilities of Large Language Models
by: Hu, Ruida, et al.
Published: (2024)
by: Hu, Ruida, et al.
Published: (2024)
AndroLibZoo: A Reliable Dataset of Libraries Based on Software Dependency Analysis
by: Samhi, Jordan, et al.
Published: (2023)
by: Samhi, Jordan, et al.
Published: (2023)
CatchAll: Repository-Aware Exception Handling with Knowledge-Guided LLMs
by: Tao, Qingxiao, et al.
Published: (2026)
by: Tao, Qingxiao, et al.
Published: (2026)
The Current Challenges of Software Engineering in the Era of Large Language Models
by: Gao, Cuiyun, et al.
Published: (2024)
by: Gao, Cuiyun, et al.
Published: (2024)
Analyzing C/C++ Library Migrations at the Package-level: Prevalence, Domains, Targets and Rationals across Seven Package Management Tools
by: Gu, Haiqiao, et al.
Published: (2025)
by: Gu, Haiqiao, et al.
Published: (2025)
APIRAT: Integrating Multi-source API Knowledge for Enhanced Code Translation with LLMs
by: Wang, Chaofan, et al.
Published: (2025)
by: Wang, Chaofan, et al.
Published: (2025)
RAG or Fine-tuning? A Comparative Study on LCMs-based Code Completion in Industry
by: Wang, Chaozheng, et al.
Published: (2025)
by: Wang, Chaozheng, et al.
Published: (2025)
Automated Prompt Generation for Code Intelligence: An Empirical study and Experience in WeChat
by: Ji, Kexing, et al.
Published: (2025)
by: Ji, Kexing, et al.
Published: (2025)
LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation
by: Kuhar, Sachit, et al.
Published: (2024)
by: Kuhar, Sachit, et al.
Published: (2024)
Similar Items
-
From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection
by: Han, Junxiao, et al.
Published: (2025) -
CodeGlance: Understanding Code Reasoning Challenges in LLMs through Multi-Dimensional Feature Analysis
by: Wang, Yunkun, et al.
Published: (2026) -
Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair
by: Li, Guochang, et al.
Published: (2024) -
Revisiting Vulnerability Patch Localization: An Empirical Study and LLM-Based Solution
by: Xu, Haoran, et al.
Published: (2025) -
ChatUniTest: A Framework for LLM-Based Test Generation
by: Chen, Yinghao, et al.
Published: (2023)