:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhong, Y., Huang, R., Wang, M., Guo, Z., Li, YC., Yu, M., Jin, Z.
Format:	Preprint
Published:	2026
Subjects:	Software Engineering Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2603.03180
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Python Symbolic Execution with LLM-powered Code Generation
by: Wang, Wenhan, et al.
Published: (2024)

Investigating Execution-Aware Language Models for Code Optimization
by: Di Menna, Federico, et al.
Published: (2025)

CodeRAG-Bench: Can Retrieval Augment Code Generation?
by: Wang, Zora Zhiruo, et al.
Published: (2024)

DuET: Dual Execution for Test Output Prediction with Generated Code and Pseudocode
by: Han, Hojae, et al.
Published: (2026)

Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented Generation
by: Acharya, Manish, et al.
Published: (2025)

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning
by: Huang, Dong, et al.
Published: (2024)

MathDuels: Evaluating LLMs as Problem Posers and Solvers
by: Xu, Zhiqiu, et al.
Published: (2026)

Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches
by: Tao, Yicheng, et al.
Published: (2025)

CodeSpecBench: Benchmarking LLMs for Executable Behavioral Specification Generation
by: Chen, Zaoyu, et al.
Published: (2026)

Execution-Aware Program Reduction for WebAssembly via Record and Replay
by: Baek, Doehyun, et al.
Published: (2025)

DependEval: Benchmarking LLMs for Repository Dependency Understanding
by: Du, Junjia, et al.
Published: (2025)

EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization
by: Huang, Dong, et al.
Published: (2024)

Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis
by: Huang, Dong, et al.
Published: (2025)

Executing as You Generate: Hiding Execution Latency in LLM Code Generation
by: Sun, Zhensu, et al.
Published: (2026)

Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
by: Cheng, Wei, et al.
Published: (2024)

GenX: Mastering Code and Test Generation with Execution Feedback
by: Wang, Nan, et al.
Published: (2024)

Solver-Independent Automated Problem Formulation via LLMs for High-Cost Simulation-Driven Design
by: Li, Yuchen, et al.
Published: (2025)

KAIJU: An Executive Kernel for Intent-Gated Execution of LLM Agents
by: Guerin, Cormac, et al.
Published: (2026)

Defusing Logic Bombs in Symbolic Execution with LLM-Generated Ghost Code
by: Bouras, Dimitrios Stamatios, et al.
Published: (2026)

CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment
by: Jiang, Xue, et al.
Published: (2025)

CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
by: Xie, Yiqing, et al.
Published: (2024)

Evaluating Retrieval-Augmented Generation Variants for Natural Language-Based SQL and API Call Generation
by: Marketsmüller, Michael, et al.
Published: (2026)

Towards Automated Smart Contract Generation: Evaluation, Benchmarking, and Retrieval-Augmented Repair
by: Chen, Zaoyu, et al.
Published: (2025)

ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation
by: Liu, Sicong, et al.
Published: (2026)

Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
by: Zhong, Li, et al.
Published: (2024)

StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
by: Wang, Hao, et al.
Published: (2026)

Can Large Language Models Simulate Symbolic Execution Output Like KLEE?
by: Feng, Rong, et al.
Published: (2025)

CUTECat: Concolic Execution for Computational Law
by: Goutagny, Pierre, et al.
Published: (2024)

Illocutionary Explanation Planning for Source-Faithful Explanations in Retrieval-Augmented Language Models
by: Sovrano, Francesco, et al.
Published: (2026)

RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
by: Wang, Zefan, et al.
Published: (2023)

From SWE-ZERO to SWE-HERO: Execution-free to Execution-based Fine-tuning for Software Engineering Agents
by: Ludwig, Nikolai, et al.
Published: (2026)

DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale
by: Zhang, Linghao, et al.
Published: (2025)

VERT: Verified Equivalent Rust Transpilation with Large Language Models as Few-Shot Learners
by: Yang, Aidan Z. H., et al.
Published: (2024)

Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation
by: Pysklo, Hubert M., et al.
Published: (2026)

Multi-Pass Targeted Dynamic Symbolic Execution
by: Yavuz, Tuba
Published: (2024)

Typing Requirement Model as Coroutines
by: Gu, Qiqi, et al.
Published: (2024)

TypePro: Boosting LLM-Based Type Inference via Inter-Procedural Slicing
by: Lin, Teyu, et al.
Published: (2026)

ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
by: Zhang, Wei, et al.
Published: (2024)

Efficient Symbolic Execution of Software under Fault Attacks
by: Fang, Yuzhou, et al.
Published: (2025)

Generating Test Scenarios from NL Requirements using Retrieval-Augmented LLMs: An Industrial Study
by: Arora, Chetan, et al.
Published: (2024)