Similar Items
Python Symbolic Execution with LLM-powered Code Generation
by: Wang, Wenhan, et al.
Published: (2024)
by: Wang, Wenhan, et al.
Published: (2024)
Investigating Execution-Aware Language Models for Code Optimization
by: Di Menna, Federico, et al.
Published: (2025)
by: Di Menna, Federico, et al.
Published: (2025)
CodeRAG-Bench: Can Retrieval Augment Code Generation?
by: Wang, Zora Zhiruo, et al.
Published: (2024)
by: Wang, Zora Zhiruo, et al.
Published: (2024)
DuET: Dual Execution for Test Output Prediction with Generated Code and Pseudocode
by: Han, Hojae, et al.
Published: (2026)
by: Han, Hojae, et al.
Published: (2026)
Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented Generation
by: Acharya, Manish, et al.
Published: (2025)
by: Acharya, Manish, et al.
Published: (2025)
EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning
by: Huang, Dong, et al.
Published: (2024)
by: Huang, Dong, et al.
Published: (2024)
MathDuels: Evaluating LLMs as Problem Posers and Solvers
by: Xu, Zhiqiu, et al.
Published: (2026)
by: Xu, Zhiqiu, et al.
Published: (2026)
Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches
by: Tao, Yicheng, et al.
Published: (2025)
by: Tao, Yicheng, et al.
Published: (2025)
CodeSpecBench: Benchmarking LLMs for Executable Behavioral Specification Generation
by: Chen, Zaoyu, et al.
Published: (2026)
by: Chen, Zaoyu, et al.
Published: (2026)
Execution-Aware Program Reduction for WebAssembly via Record and Replay
by: Baek, Doehyun, et al.
Published: (2025)
by: Baek, Doehyun, et al.
Published: (2025)
DependEval: Benchmarking LLMs for Repository Dependency Understanding
by: Du, Junjia, et al.
Published: (2025)
by: Du, Junjia, et al.
Published: (2025)
EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization
by: Huang, Dong, et al.
Published: (2024)
by: Huang, Dong, et al.
Published: (2024)
Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis
by: Huang, Dong, et al.
Published: (2025)
by: Huang, Dong, et al.
Published: (2025)
Executing as You Generate: Hiding Execution Latency in LLM Code Generation
by: Sun, Zhensu, et al.
Published: (2026)
by: Sun, Zhensu, et al.
Published: (2026)
Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
by: Cheng, Wei, et al.
Published: (2024)
by: Cheng, Wei, et al.
Published: (2024)
GenX: Mastering Code and Test Generation with Execution Feedback
by: Wang, Nan, et al.
Published: (2024)
by: Wang, Nan, et al.
Published: (2024)
Solver-Independent Automated Problem Formulation via LLMs for High-Cost Simulation-Driven Design
by: Li, Yuchen, et al.
Published: (2025)
by: Li, Yuchen, et al.
Published: (2025)
KAIJU: An Executive Kernel for Intent-Gated Execution of LLM Agents
by: Guerin, Cormac, et al.
Published: (2026)
by: Guerin, Cormac, et al.
Published: (2026)
Defusing Logic Bombs in Symbolic Execution with LLM-Generated Ghost Code
by: Bouras, Dimitrios Stamatios, et al.
Published: (2026)
by: Bouras, Dimitrios Stamatios, et al.
Published: (2026)
CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment
by: Jiang, Xue, et al.
Published: (2025)
by: Jiang, Xue, et al.
Published: (2025)
CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
by: Xie, Yiqing, et al.
Published: (2024)
by: Xie, Yiqing, et al.
Published: (2024)
Evaluating Retrieval-Augmented Generation Variants for Natural Language-Based SQL and API Call Generation
by: Marketsmüller, Michael, et al.
Published: (2026)
by: Marketsmüller, Michael, et al.
Published: (2026)
Towards Automated Smart Contract Generation: Evaluation, Benchmarking, and Retrieval-Augmented Repair
by: Chen, Zaoyu, et al.
Published: (2025)
by: Chen, Zaoyu, et al.
Published: (2025)
ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation
by: Liu, Sicong, et al.
Published: (2026)
by: Liu, Sicong, et al.
Published: (2026)
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
by: Zhong, Li, et al.
Published: (2024)
by: Zhong, Li, et al.
Published: (2024)
StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Can Large Language Models Simulate Symbolic Execution Output Like KLEE?
by: Feng, Rong, et al.
Published: (2025)
by: Feng, Rong, et al.
Published: (2025)
CUTECat: Concolic Execution for Computational Law
by: Goutagny, Pierre, et al.
Published: (2024)
by: Goutagny, Pierre, et al.
Published: (2024)
Illocutionary Explanation Planning for Source-Faithful Explanations in Retrieval-Augmented Language Models
by: Sovrano, Francesco, et al.
Published: (2026)
by: Sovrano, Francesco, et al.
Published: (2026)
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
by: Wang, Zefan, et al.
Published: (2023)
by: Wang, Zefan, et al.
Published: (2023)
From SWE-ZERO to SWE-HERO: Execution-free to Execution-based Fine-tuning for Software Engineering Agents
by: Ludwig, Nikolai, et al.
Published: (2026)
by: Ludwig, Nikolai, et al.
Published: (2026)
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale
by: Zhang, Linghao, et al.
Published: (2025)
by: Zhang, Linghao, et al.
Published: (2025)
VERT: Verified Equivalent Rust Transpilation with Large Language Models as Few-Shot Learners
by: Yang, Aidan Z. H., et al.
Published: (2024)
by: Yang, Aidan Z. H., et al.
Published: (2024)
Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation
by: Pysklo, Hubert M., et al.
Published: (2026)
by: Pysklo, Hubert M., et al.
Published: (2026)
Multi-Pass Targeted Dynamic Symbolic Execution
by: Yavuz, Tuba
Published: (2024)
by: Yavuz, Tuba
Published: (2024)
Typing Requirement Model as Coroutines
by: Gu, Qiqi, et al.
Published: (2024)
by: Gu, Qiqi, et al.
Published: (2024)
TypePro: Boosting LLM-Based Type Inference via Inter-Procedural Slicing
by: Lin, Teyu, et al.
Published: (2026)
by: Lin, Teyu, et al.
Published: (2026)
ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
by: Zhang, Wei, et al.
Published: (2024)
by: Zhang, Wei, et al.
Published: (2024)
Efficient Symbolic Execution of Software under Fault Attacks
by: Fang, Yuzhou, et al.
Published: (2025)
by: Fang, Yuzhou, et al.
Published: (2025)
Generating Test Scenarios from NL Requirements using Retrieval-Augmented LLMs: An Industrial Study
by: Arora, Chetan, et al.
Published: (2024)
by: Arora, Chetan, et al.
Published: (2024)
Similar Items
-
Python Symbolic Execution with LLM-powered Code Generation
by: Wang, Wenhan, et al.
Published: (2024) -
Investigating Execution-Aware Language Models for Code Optimization
by: Di Menna, Federico, et al.
Published: (2025) -
CodeRAG-Bench: Can Retrieval Augment Code Generation?
by: Wang, Zora Zhiruo, et al.
Published: (2024) -
DuET: Dual Execution for Test Output Prediction with Generated Code and Pseudocode
by: Han, Hojae, et al.
Published: (2026) -
Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented Generation
by: Acharya, Manish, et al.
Published: (2025)