Saved in:
| Main Authors: | Lin, Zi, Shen, Sheng, Kulikov, Ilia, Shang, Jingbo, Weston, Jason, Nie, Yixin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.14948 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation
by: Sheng, Junjie, et al.
Published: (2025)
by: Sheng, Junjie, et al.
Published: (2025)
CodeDPO: Aligning Code Models with Self Generated and Verified Source Code
by: Zhang, Kechi, et al.
Published: (2024)
by: Zhang, Kechi, et al.
Published: (2024)
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
by: Zhong, Li, et al.
Published: (2024)
by: Zhong, Li, et al.
Published: (2024)
Clover: Closed-Loop Verifiable Code Generation
by: Sun, Chuyue, et al.
Published: (2023)
by: Sun, Chuyue, et al.
Published: (2023)
An Iterative Test-and-Repair Framework for Competitive Code Generation
by: Tang, Lingxiao, et al.
Published: (2026)
by: Tang, Lingxiao, et al.
Published: (2026)
Revisit Self-Debugging with Self-Generated Tests for Code Generation
by: Chen, Xiancai, et al.
Published: (2025)
by: Chen, Xiancai, et al.
Published: (2025)
Self-planning Code Generation with Large Language Models
by: Jiang, Xue, et al.
Published: (2023)
by: Jiang, Xue, et al.
Published: (2023)
VerifyThisBench: Generating Code, Specifications, and Proofs All at Once
by: Deng, Xun, et al.
Published: (2025)
by: Deng, Xun, et al.
Published: (2025)
Efficient Incremental Code Coverage Analysis for Regression Test Suites
by: Wang, Jiale Amber, et al.
Published: (2024)
by: Wang, Jiale Amber, et al.
Published: (2024)
Understanding Self-Admitted Technical Debt in Test Code: An Empirical Study
by: Nakamura, Ibuki, et al.
Published: (2025)
by: Nakamura, Ibuki, et al.
Published: (2025)
PlayCoder: Making LLM-Generated GUI Code Playable
by: Peng, Zhiyuan, et al.
Published: (2026)
by: Peng, Zhiyuan, et al.
Published: (2026)
VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation
by: Bai, Yifan, et al.
Published: (2026)
by: Bai, Yifan, et al.
Published: (2026)
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs
by: Liu, Zhihan, et al.
Published: (2024)
by: Liu, Zhihan, et al.
Published: (2024)
WybeCoder: Verified Imperative Code Generation
by: Gloeckle, Fabian, et al.
Published: (2026)
by: Gloeckle, Fabian, et al.
Published: (2026)
DEFT: Differentiable Automatic Test Pattern Generation
by: Li, Wei, et al.
Published: (2025)
by: Li, Wei, et al.
Published: (2025)
Effective Random Test Generation for Deep Learning Compilers
by: Ren, Luyao, et al.
Published: (2023)
by: Ren, Luyao, et al.
Published: (2023)
CRScore++: Reinforcement Learning with Verifiable Tool and AI Feedback for Code Review
by: Kapadnis, Manav Nitin, et al.
Published: (2025)
by: Kapadnis, Manav Nitin, et al.
Published: (2025)
CasModaTest: A Cascaded and Model-agnostic Self-directed Framework for Unit Test Generation
by: Ni, Chao, et al.
Published: (2024)
by: Ni, Chao, et al.
Published: (2024)
CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation
by: Pan, Ruwei, et al.
Published: (2025)
by: Pan, Ruwei, et al.
Published: (2025)
Klear-CodeTest: Scalable Test Case Generation for Code Reinforcement Learning
by: Fu, Jia, et al.
Published: (2025)
by: Fu, Jia, et al.
Published: (2025)
ExecVerify: White-Box RL with Verifiable Stepwise Rewards for Code Execution Reasoning
by: Tang, Lingxiao, et al.
Published: (2026)
by: Tang, Lingxiao, et al.
Published: (2026)
Detect Repair Verify for Securing LLM Generated Code: A Multi-Language Empirical Study
by: Cheng, Cheng
Published: (2026)
by: Cheng, Cheng
Published: (2026)
Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR
by: Sun, Zeyu, et al.
Published: (2025)
by: Sun, Zeyu, et al.
Published: (2025)
ACE: Self-Evolving LLM Coding Framework via Adversarial Unit Test Generation and Preference Optimization
by: Huang, Yixu, et al.
Published: (2026)
by: Huang, Yixu, et al.
Published: (2026)
WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation
by: Xu, Mingde, et al.
Published: (2025)
by: Xu, Mingde, et al.
Published: (2025)
Detect--Repair--Verify for LLM-Generated Code: A Multi-Language, Multi-Granularity Empirical Study
by: Cheng, Cheng
Published: (2026)
by: Cheng, Cheng
Published: (2026)
VeCoGen: Automating Generation of Formally Verified C Code with Large Language Models
by: Sevenhuijsen, Merlijn, et al.
Published: (2024)
by: Sevenhuijsen, Merlijn, et al.
Published: (2024)
R2Code: A Self-Reflective LLM Framework for Requirements-to-Code Traceability
by: Wang, Yifei, et al.
Published: (2026)
by: Wang, Yifei, et al.
Published: (2026)
Breaking, Stale, or Missing? Benchmarking Coding Agents on Project-Level Test Evolution
by: Shang, Ye, et al.
Published: (2026)
by: Shang, Ye, et al.
Published: (2026)
BlueCodeAgent: A Blue Teaming Agent Enabled by Automated Red Teaming for CodeGen AI
by: Guo, Chengquan, et al.
Published: (2025)
by: Guo, Chengquan, et al.
Published: (2025)
ATLAS: Automated Toolkit for Large-Scale Verified Code Synthesis
by: Baksys, Mantas, et al.
Published: (2025)
by: Baksys, Mantas, et al.
Published: (2025)
TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation
by: Shen, Ming-Tung, et al.
Published: (2025)
by: Shen, Ming-Tung, et al.
Published: (2025)
Failure-Aware Enhancements for Large Language Model (LLM) Code Generation: An Empirical Study on Decision Framework
by: Shen, Jianru, et al.
Published: (2026)
by: Shen, Jianru, et al.
Published: (2026)
A Regression Testing Framework with Automated Assertion Generation for Machine Learning Notebooks
by: Yao, Yingao Elaine, et al.
Published: (2025)
by: Yao, Yingao Elaine, et al.
Published: (2025)
A First Look at the Self-Admitted Technical Debt in Test Code: Taxonomy and Detection
by: Islam, Shahidul, et al.
Published: (2025)
by: Islam, Shahidul, et al.
Published: (2025)
TENET: Leveraging Tests Beyond Validation for Code Generation
by: Hu, Yiran, et al.
Published: (2025)
by: Hu, Yiran, et al.
Published: (2025)
CYCLE: Learning to Self-Refine the Code Generation
by: Ding, Yangruibo, et al.
Published: (2024)
by: Ding, Yangruibo, et al.
Published: (2024)
Model Cascading for Code: A Cascaded Black-Box Multi-Model Framework for Cost-Efficient Code Completion with Self-Testing
by: Chen, Boyuan, et al.
Published: (2024)
by: Chen, Boyuan, et al.
Published: (2024)
Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK
by: Cramer, Marcos, et al.
Published: (2025)
by: Cramer, Marcos, et al.
Published: (2025)
Can LLMs Solve Science or Just Write Code? Evaluating Quantum Solver Generation
by: Baresi, Luciano, et al.
Published: (2026)
by: Baresi, Luciano, et al.
Published: (2026)
Similar Items
-
SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation
by: Sheng, Junjie, et al.
Published: (2025) -
CodeDPO: Aligning Code Models with Self Generated and Verified Source Code
by: Zhang, Kechi, et al.
Published: (2024) -
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
by: Zhong, Li, et al.
Published: (2024) -
Clover: Closed-Loop Verifiable Code Generation
by: Sun, Chuyue, et al.
Published: (2023) -
An Iterative Test-and-Repair Framework for Competitive Code Generation
by: Tang, Lingxiao, et al.
Published: (2026)