:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	McAndrews, Charles Junichi
Format:	Preprint
Published:	2026
Subjects:	Software Engineering Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2604.21950
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Machine Learning Canvas: Empirical Findings on Why Strategy Matters More Than AI Code Generation
by: Prause, Martin
Published: (2026)

Improving Code Generation by Training with Natural Language Feedback
by: Chen, Angelica, et al.
Published: (2023)

PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback
by: Peng, Yun, et al.
Published: (2024)

Semantic Voting: Execution-Grounded Consensus for LLM Code Generation
by: Jiang, Shan, et al.
Published: (2026)

Using a Feedback Loop for LLM-based Infrastructure as Code Generation
by: Palavalli, Mayur Amarnath, et al.
Published: (2024)

Position: Early-Stage Quality Assurance in Annotation Pipelines Is More Cost-Effective Than Late-Stage Validation
by: Kothari, Sunil, et al.
Published: (2026)

Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving
by: Silva, Priscylla, et al.
Published: (2025)

Ambiguity Resolution with Human Feedback for Code Writing Tasks
by: Nandan, Aditey, et al.
Published: (2025)

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
by: Gu, Alex, et al.
Published: (2024)

1D-Bench: A Benchmark for Iterative UI Code Generation with Visual Feedback in Real-World
by: Xu, Qiao, et al.
Published: (2026)

Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis
by: Dolcetti, Greta, et al.
Published: (2024)

Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach
by: Sepidband, Melika, et al.
Published: (2025)

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
by: Zhuo, Terry Yue, et al.
Published: (2025)

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments
by: Han, Hojae, et al.
Published: (2025)

RedCode: Risky Code Execution and Generation Benchmark for Code Agents
by: Guo, Chengquan, et al.
Published: (2024)

A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement
by: Zhang, Huan, et al.
Published: (2024)

Vibe-Coding: Feedback-Based Automated Verification with no Human Code Inspection, a Feasibility Study
by: Töpfer, Michal, et al.
Published: (2026)

Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation
by: Yu, Zhuohao, et al.
Published: (2024)

Effective LLM Code Refinement via Property-Oriented and Structurally Minimal Feedback
by: He, Lehan, et al.
Published: (2025)

Coding with Eyes: Visual Feedback Unlocks Reliable GUI Code Generating and Debugging
by: Liu, Zhilin, et al.
Published: (2026)

More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents
by: Gao, Pengfei, et al.
Published: (2025)

Towards More Trustworthy and Interpretable LLMs for Code through Syntax-Grounded Explanations
by: Palacio, David N., et al.
Published: (2024)

SWE-PRBench: Benchmarking AI Code Review Quality Against Pull Request Feedback
by: Kumar, Deepak
Published: (2026)

EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback
by: Hajizadeh, Samira, et al.
Published: (2026)

Automatic Generation of Executable BPMN Models from Medical Guidelines
by: Sekar, Praveen Kumar Menaka, et al.
Published: (2026)

Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs
by: Sakharova, Marina, et al.
Published: (2025)

Executing as You Generate: Hiding Execution Latency in LLM Code Generation
by: Sun, Zhensu, et al.
Published: (2026)

Toward Executable Repository-Level Code Generation via Environment Alignment
by: Pan, Ruwei, et al.
Published: (2026)

AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
by: Liu, Huanxi, et al.
Published: (2024)

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
by: Zheng, Tianyu, et al.
Published: (2024)

Benchmarking Large Language Models for ABAP Code Generation: An Empirical Study on Iterative Improvement by Compiler Feedback
by: Wallraven, Stephan, et al.
Published: (2026)

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback
by: Sun, Qiushi, et al.
Published: (2025)

CodeTaste: Can LLMs Generate Human-Level Code Refactorings?
by: Thillen, Alex, et al.
Published: (2026)

Stack Trace Deduplication: Faster, More Accurately, and in More Realistic Scenarios
by: Shibaev, Egor, et al.
Published: (2024)

Mage: Multi-Axis Evaluation of LLM-Generated Executable Game Scenes Beyond Compile-Pass Rate
by: Liu, Hugh Xuechen, et al.
Published: (2026)

Repo2Run: Automated Building Executable Environment for Code Repository at Scale
by: Hu, Ruida, et al.
Published: (2025)

Optimizing AI-Assisted Code Generation
by: Torka, Simon, et al.
Published: (2024)

Operational Robustness of LLMs on Code Generation
by: Paul, Debalina Ghosh, et al.
Published: (2026)

Can Coding Agents Be General Agents?
by: Ivanov, Maksim, et al.
Published: (2026)

Beyond Execution: Static-Analysis Rewards and Hint-Conditioned Diffusion RL for Code Generation
by: Ouyang, Shuyin, et al.
Published: (2026)