:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	McMillan, Damon
Format:	Preprint
Published:	2026
Subjects:	Software Engineering Computation and Language
Online Access:	https://arxiv.org/abs/2605.10039
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Structured Context Engineering for File-Native Agentic Systems: Evaluating Schema Accuracy, Format Effectiveness, and Multi-File Navigation at Scale
by: McMillan, Damon
Published: (2026)

MultiFileTest: A Multi-File-Level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
by: Wang, Yibo, et al.
Published: (2025)

Agent READMEs: An Empirical Study of Context Files for Agentic Coding
by: Chatlatanagulchai, Worawalan, et al.
Published: (2025)

APIDA-Chat: Structured Synthesis of API Search Dialogues to Bootstrap Conversational Agents
by: Eberhart, Zachary, et al.
Published: (2025)

The Stability Trap: Evaluating the Reliability of LLM-Based Instruction Adherence Auditing
by: Shergadwala, Murtuza N.
Published: (2026)

Exploring Direct Instruction and Summary-Mediated Prompting in LLM-Assisted Code Modification
by: Tang, Ningzhi, et al.
Published: (2025)

Attention Mechanism and Heuristic Approach: Context-Aware File Ranking Using Multi-Head Self-Attention
by: Sharma, Pradeep Kumar, et al.
Published: (2026)

Do Code LLMs Do Static Analysis?
by: Su, Chia-Yi, et al.
Published: (2025)

CMind: An AI Agent for Localizing C Memory Bugs
by: Su, Chia-Yi, et al.
Published: (2026)

OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs
by: Ahmad, Wasi Uddin, et al.
Published: (2025)

Distilled GPT for Source Code Summarization
by: Su, Chia-Yi, et al.
Published: (2023)

Encrypted Container File: Design and Implementation of a Hybrid-Encrypted Multi-Recipient File Structure
by: Bauer, Tobias J., et al.
Published: (2024)

An Empirical Study of Dotfiles Repositories Containing User-Specific Configuration Files
by: Zhu, Wenhan, et al.
Published: (2025)

Semantic Similarity Loss for Neural Source Code Summarization
by: Su, Chia-Yi, et al.
Published: (2023)

AI-Mediated Code Comment Improvement
by: Dhakal, Maria, et al.
Published: (2025)

ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents
by: Jeon, YoungHoon, et al.
Published: (2026)

ContextCov: Deriving and Enforcing Executable Constraints from Agent Instruction Files
by: Sharma, Reshabh K
Published: (2026)

InstructCoder: Instruction Tuning Large Language Models for Code Editing
by: Li, Kaixin, et al.
Published: (2023)

AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
by: Zi, Yangtian, et al.
Published: (2025)

Operationalizing Ethics for AI Agents: How Developers Encode Values into Repository Context Files
by: Treude, Christoph, et al.
Published: (2026)

From Output to Evaluation: Does Raw Instruction-Tuned Code LLMs Output Suffice for Fill-in-the-Middle Code Generation?
by: Ahmad, Wasi Uddin, et al.
Published: (2025)

From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
by: Yang, Jian, et al.
Published: (2025)

Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
by: Gloaguen, Thibaud, et al.
Published: (2026)

Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation
by: Fu, Lingyue, et al.
Published: (2025)

Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks
by: Zhao, Songwen, et al.
Published: (2025)

NALA_MAINZ at BLP-2025 Task 2: A Multi-agent Approach for Bangla Instruction to Python Code Generation
by: Saadi, Hossain Shaikh, et al.
Published: (2025)

From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning
by: Zhang, Jiajun, et al.
Published: (2026)

CodeScout: Contextual Problem Statement Enhancement for Software Agents
by: Suri, Manan, et al.
Published: (2026)

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
by: Wang, Yuhang, et al.
Published: (2026)

A Study on Developer Behaviors for Validating and Repairing LLM-Generated Code Using Eye Tracking and IDE Actions
by: Tang, Ningzhi, et al.
Published: (2024)

EffiSkill: Agent Skill Based Automated Code Efficiency Optimization
by: Wang, Zimu, et al.
Published: (2026)

Towards Exception Safety Code Generation with Intermediate Representation Agents Framework
by: Zhang, Xuanming, et al.
Published: (2024)

Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study
by: Chand, Sivajeet, et al.
Published: (2026)

Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents
by: Kuang, Jiayi, et al.
Published: (2025)

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
by: Zhuo, Terry Yue, et al.
Published: (2024)

ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
by: Liu, Kaiyuan, et al.
Published: (2025)

Measuring LLM Code Generation Stability via Structural Entropy
by: Song, Yewei, et al.
Published: (2025)

Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation
by: Pysklo, Hubert M., et al.
Published: (2026)

Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses
by: Lin, Jiahang, et al.
Published: (2026)

Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework
by: Zhang, Xuanming, et al.
Published: (2024)