Saved in:
| Main Author: | McMillan, Damon |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.10039 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Structured Context Engineering for File-Native Agentic Systems: Evaluating Schema Accuracy, Format Effectiveness, and Multi-File Navigation at Scale
by: McMillan, Damon
Published: (2026)
by: McMillan, Damon
Published: (2026)
MultiFileTest: A Multi-File-Level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
by: Wang, Yibo, et al.
Published: (2025)
by: Wang, Yibo, et al.
Published: (2025)
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
by: Chatlatanagulchai, Worawalan, et al.
Published: (2025)
by: Chatlatanagulchai, Worawalan, et al.
Published: (2025)
APIDA-Chat: Structured Synthesis of API Search Dialogues to Bootstrap Conversational Agents
by: Eberhart, Zachary, et al.
Published: (2025)
by: Eberhart, Zachary, et al.
Published: (2025)
The Stability Trap: Evaluating the Reliability of LLM-Based Instruction Adherence Auditing
by: Shergadwala, Murtuza N.
Published: (2026)
by: Shergadwala, Murtuza N.
Published: (2026)
Exploring Direct Instruction and Summary-Mediated Prompting in LLM-Assisted Code Modification
by: Tang, Ningzhi, et al.
Published: (2025)
by: Tang, Ningzhi, et al.
Published: (2025)
Attention Mechanism and Heuristic Approach: Context-Aware File Ranking Using Multi-Head Self-Attention
by: Sharma, Pradeep Kumar, et al.
Published: (2026)
by: Sharma, Pradeep Kumar, et al.
Published: (2026)
Do Code LLMs Do Static Analysis?
by: Su, Chia-Yi, et al.
Published: (2025)
by: Su, Chia-Yi, et al.
Published: (2025)
CMind: An AI Agent for Localizing C Memory Bugs
by: Su, Chia-Yi, et al.
Published: (2026)
by: Su, Chia-Yi, et al.
Published: (2026)
OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs
by: Ahmad, Wasi Uddin, et al.
Published: (2025)
by: Ahmad, Wasi Uddin, et al.
Published: (2025)
Distilled GPT for Source Code Summarization
by: Su, Chia-Yi, et al.
Published: (2023)
by: Su, Chia-Yi, et al.
Published: (2023)
Encrypted Container File: Design and Implementation of a Hybrid-Encrypted Multi-Recipient File Structure
by: Bauer, Tobias J., et al.
Published: (2024)
by: Bauer, Tobias J., et al.
Published: (2024)
An Empirical Study of Dotfiles Repositories Containing User-Specific Configuration Files
by: Zhu, Wenhan, et al.
Published: (2025)
by: Zhu, Wenhan, et al.
Published: (2025)
Semantic Similarity Loss for Neural Source Code Summarization
by: Su, Chia-Yi, et al.
Published: (2023)
by: Su, Chia-Yi, et al.
Published: (2023)
AI-Mediated Code Comment Improvement
by: Dhakal, Maria, et al.
Published: (2025)
by: Dhakal, Maria, et al.
Published: (2025)
ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents
by: Jeon, YoungHoon, et al.
Published: (2026)
by: Jeon, YoungHoon, et al.
Published: (2026)
ContextCov: Deriving and Enforcing Executable Constraints from Agent Instruction Files
by: Sharma, Reshabh K
Published: (2026)
by: Sharma, Reshabh K
Published: (2026)
InstructCoder: Instruction Tuning Large Language Models for Code Editing
by: Li, Kaixin, et al.
Published: (2023)
by: Li, Kaixin, et al.
Published: (2023)
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
by: Zi, Yangtian, et al.
Published: (2025)
by: Zi, Yangtian, et al.
Published: (2025)
Operationalizing Ethics for AI Agents: How Developers Encode Values into Repository Context Files
by: Treude, Christoph, et al.
Published: (2026)
by: Treude, Christoph, et al.
Published: (2026)
From Output to Evaluation: Does Raw Instruction-Tuned Code LLMs Output Suffice for Fill-in-the-Middle Code Generation?
by: Ahmad, Wasi Uddin, et al.
Published: (2025)
by: Ahmad, Wasi Uddin, et al.
Published: (2025)
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
by: Yang, Jian, et al.
Published: (2025)
by: Yang, Jian, et al.
Published: (2025)
Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
by: Gloaguen, Thibaud, et al.
Published: (2026)
by: Gloaguen, Thibaud, et al.
Published: (2026)
Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation
by: Fu, Lingyue, et al.
Published: (2025)
by: Fu, Lingyue, et al.
Published: (2025)
Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks
by: Zhao, Songwen, et al.
Published: (2025)
by: Zhao, Songwen, et al.
Published: (2025)
NALA_MAINZ at BLP-2025 Task 2: A Multi-agent Approach for Bangla Instruction to Python Code Generation
by: Saadi, Hossain Shaikh, et al.
Published: (2025)
by: Saadi, Hossain Shaikh, et al.
Published: (2025)
From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning
by: Zhang, Jiajun, et al.
Published: (2026)
by: Zhang, Jiajun, et al.
Published: (2026)
CodeScout: Contextual Problem Statement Enhancement for Software Agents
by: Suri, Manan, et al.
Published: (2026)
by: Suri, Manan, et al.
Published: (2026)
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
by: Wang, Yuhang, et al.
Published: (2026)
by: Wang, Yuhang, et al.
Published: (2026)
A Study on Developer Behaviors for Validating and Repairing LLM-Generated Code Using Eye Tracking and IDE Actions
by: Tang, Ningzhi, et al.
Published: (2024)
by: Tang, Ningzhi, et al.
Published: (2024)
EffiSkill: Agent Skill Based Automated Code Efficiency Optimization
by: Wang, Zimu, et al.
Published: (2026)
by: Wang, Zimu, et al.
Published: (2026)
Towards Exception Safety Code Generation with Intermediate Representation Agents Framework
by: Zhang, Xuanming, et al.
Published: (2024)
by: Zhang, Xuanming, et al.
Published: (2024)
Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study
by: Chand, Sivajeet, et al.
Published: (2026)
by: Chand, Sivajeet, et al.
Published: (2026)
Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents
by: Kuang, Jiayi, et al.
Published: (2025)
by: Kuang, Jiayi, et al.
Published: (2025)
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
by: Zhuo, Terry Yue, et al.
Published: (2024)
by: Zhuo, Terry Yue, et al.
Published: (2024)
ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
by: Liu, Kaiyuan, et al.
Published: (2025)
by: Liu, Kaiyuan, et al.
Published: (2025)
Measuring LLM Code Generation Stability via Structural Entropy
by: Song, Yewei, et al.
Published: (2025)
by: Song, Yewei, et al.
Published: (2025)
Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation
by: Pysklo, Hubert M., et al.
Published: (2026)
by: Pysklo, Hubert M., et al.
Published: (2026)
Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses
by: Lin, Jiahang, et al.
Published: (2026)
by: Lin, Jiahang, et al.
Published: (2026)
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework
by: Zhang, Xuanming, et al.
Published: (2024)
by: Zhang, Xuanming, et al.
Published: (2024)
Similar Items
-
Structured Context Engineering for File-Native Agentic Systems: Evaluating Schema Accuracy, Format Effectiveness, and Multi-File Navigation at Scale
by: McMillan, Damon
Published: (2026) -
MultiFileTest: A Multi-File-Level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
by: Wang, Yibo, et al.
Published: (2025) -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
by: Chatlatanagulchai, Worawalan, et al.
Published: (2025) -
APIDA-Chat: Structured Synthesis of API Search Dialogues to Bootstrap Conversational Agents
by: Eberhart, Zachary, et al.
Published: (2025) -
The Stability Trap: Evaluating the Reliability of LLM-Based Instruction Adherence Auditing
by: Shergadwala, Murtuza N.
Published: (2026)