Saved in:
| Main Authors: | Chai, Linzheng, Yang, Jian, Liu, Shukai, Zhang, Wei, Wang, Liran, Jin, Ke, Sun, Tao, Liu, Congnan, Zhang, Chenchen, Zhu, Hualei, Liu, Jiaheng, Wu, Xianjie, Zhang, Ge, Liu, Tianyu, Li, Zhoujun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.08719 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
by: Liu, Jiaheng, et al.
Published: (2024)
by: Liu, Jiaheng, et al.
Published: (2024)
In-Context Code-Text Learning for Bimodal Software Engineering
by: Tang, Xunzhu, et al.
Published: (2024)
by: Tang, Xunzhu, et al.
Published: (2024)
R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models
by: Deng, Ken, et al.
Published: (2024)
by: Deng, Ken, et al.
Published: (2024)
CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges
by: Zhang, Kechi, et al.
Published: (2024)
by: Zhang, Kechi, et al.
Published: (2024)
Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining
by: Zeng, Zhiyuan, et al.
Published: (2026)
by: Zeng, Zhiyuan, et al.
Published: (2026)
Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering
by: Fu, Kelin, et al.
Published: (2025)
by: Fu, Kelin, et al.
Published: (2025)
GraphCodeAgent: Dual Graph-Guided LLM Agent for Retrieval-Augmented Repo-Level Code Generation
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
MdEval: Massively Multilingual Code Debugging
by: Liu, Shukai, et al.
Published: (2024)
by: Liu, Shukai, et al.
Published: (2024)
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
by: Yang, Jian, et al.
Published: (2025)
by: Yang, Jian, et al.
Published: (2025)
RealBench: A Repo-Level Code Generation Benchmark Aligned with Real-World Software Development Practices
by: Li, Jia, et al.
Published: (2026)
by: Li, Jia, et al.
Published: (2026)
aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models
by: Lei, Xinping, et al.
Published: (2026)
by: Lei, Xinping, et al.
Published: (2026)
CodeDPO: Aligning Code Models with Self Generated and Verified Source Code
by: Zhang, Kechi, et al.
Published: (2024)
by: Zhang, Kechi, et al.
Published: (2024)
Identifying Root Cause of bugs by Capturing Changed Code Lines with Relational Graph Neural Networks
by: Zhang, Jiaqi, et al.
Published: (2025)
by: Zhang, Jiaqi, et al.
Published: (2025)
Knowledge-Guided Multi-Agent Framework for Application-Level Software Code Generation
by: Xiong, Qian, et al.
Published: (2025)
by: Xiong, Qian, et al.
Published: (2025)
MLAD: A Unified Model for Multi-system Log Anomaly Detection
by: Zang, Runqiang, et al.
Published: (2024)
by: Zang, Runqiang, et al.
Published: (2024)
Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents
by: Wang, Kaixin, et al.
Published: (2025)
by: Wang, Kaixin, et al.
Published: (2025)
OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution
by: Guo, Lianghong, et al.
Published: (2025)
by: Guo, Lianghong, et al.
Published: (2025)
Primary Breadth-First Development (PBFD): An Approach to Full Stack Software Development
by: Liu, Dong
Published: (2025)
by: Liu, Dong
Published: (2025)
Write Your Own CodeChecker: An Automated Test-Driven Checker Development Approach with LLMs
by: Liu, Jun, et al.
Published: (2024)
by: Liu, Jun, et al.
Published: (2024)
McEval: Massively Multilingual Code Evaluation
by: Chai, Linzheng, et al.
Published: (2024)
by: Chai, Linzheng, et al.
Published: (2024)
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
by: Guo, Hongcheng, et al.
Published: (2024)
by: Guo, Hongcheng, et al.
Published: (2024)
LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
VFArchē: A Dual-Mode Framework for Locating Vulnerable Functions in Open-Source Software
by: Zhang, Lyuye, et al.
Published: (2025)
by: Zhang, Lyuye, et al.
Published: (2025)
A Viable Paradigm of Software Automation: Iterative End-to-End Automated Software Development
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
CodeMEM: AST-Guided Adaptive Memory for Repository-Level Iterative Code Generation
by: Wang, Peiding, et al.
Published: (2026)
by: Wang, Peiding, et al.
Published: (2026)
Requirements Development and Formalization for Reliable Code Generation: A Multi-Agent Vision
by: Lu, Xu, et al.
Published: (2025)
by: Lu, Xu, et al.
Published: (2025)
On the Road to Personalized Code Intelligence: Portraiting and Assisting Developers Based on Their In-IDE Behaviors
by: Liu, Yuhong, et al.
Published: (2026)
by: Liu, Yuhong, et al.
Published: (2026)
SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization
by: Reddy, Revanth Gangi, et al.
Published: (2025)
by: Reddy, Revanth Gangi, et al.
Published: (2025)
Think Anywhere in Code Generation
by: Jiang, Xue, et al.
Published: (2026)
by: Jiang, Xue, et al.
Published: (2026)
Contextualized Code Pretraining for Code Generation
by: Liu, Chen, et al.
Published: (2026)
by: Liu, Chen, et al.
Published: (2026)
HiRoPE: Length Extrapolation for Code Models Using Hierarchical Position
by: Zhang, Kechi, et al.
Published: (2024)
by: Zhang, Kechi, et al.
Published: (2024)
ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback
by: Zhang, Wei, et al.
Published: (2024)
by: Zhang, Wei, et al.
Published: (2024)
Turning the Tide: Repository-based Code Reflection
by: Zhang, Wei, et al.
Published: (2025)
by: Zhang, Wei, et al.
Published: (2025)
ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Models for Code Generation
by: Jiang, Xue, et al.
Published: (2024)
by: Jiang, Xue, et al.
Published: (2024)
Drop the Golden Apples: Identifying Third-Party Reuse by DB-Less Software Composition Analysis
by: Zhang, Lyuye, et al.
Published: (2025)
by: Zhang, Lyuye, et al.
Published: (2025)
CodeChemist: Functional Knowledge Transfer for Low-Resource Code Generation via Test-Time Scaling
by: Wang, Kaixin, et al.
Published: (2025)
by: Wang, Kaixin, et al.
Published: (2025)
AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators
by: Chou, Jason, et al.
Published: (2025)
by: Chou, Jason, et al.
Published: (2025)
Development and Benchmarking of Multilingual Code Clone Detector
by: Zhu, Wenqing, et al.
Published: (2024)
by: Zhu, Wenqing, et al.
Published: (2024)
SEAlign: Alignment Training for Software Engineering Agent
by: Zhang, Kechi, et al.
Published: (2025)
by: Zhang, Kechi, et al.
Published: (2025)
Similar Items
-
M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
by: Liu, Jiaheng, et al.
Published: (2024) -
In-Context Code-Text Learning for Bimodal Software Engineering
by: Tang, Xunzhu, et al.
Published: (2024) -
R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models
by: Deng, Ken, et al.
Published: (2024) -
CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges
by: Zhang, Kechi, et al.
Published: (2024) -
Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining
by: Zeng, Zhiyuan, et al.
Published: (2026)