Saved in:
| Main Authors: | Shen, Chen, Cheng, Wei, Yang, Jingyue, Zhang, Huan, Wu, Yuhan, Hu, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06976 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus
by: Zhang, Huan, et al.
Published: (2026)
by: Zhang, Huan, et al.
Published: (2026)
PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models
by: Chen, Simin, et al.
Published: (2024)
by: Chen, Simin, et al.
Published: (2024)
DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data
by: Wang, Bin, et al.
Published: (2024)
by: Wang, Bin, et al.
Published: (2024)
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
CodeMind: Evaluating Large Language Models for Code Reasoning
by: Liu, Changshu, et al.
Published: (2024)
by: Liu, Changshu, et al.
Published: (2024)
AutoCode: LLMs as Problem Setters for Competitive Programming
by: Zhou, Shang, et al.
Published: (2025)
by: Zhou, Shang, et al.
Published: (2025)
AI-Assisted Fixes to Code Review Comments at Scale
by: Maddila, Chandra, et al.
Published: (2025)
by: Maddila, Chandra, et al.
Published: (2025)
KiloBot: A Programming Language for Deploying Perception-Guided Industrial Manipulators at Scale
by: Gao, Wei, et al.
Published: (2024)
by: Gao, Wei, et al.
Published: (2024)
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie Worksheets
by: Joshi, Harshit, et al.
Published: (2024)
by: Joshi, Harshit, et al.
Published: (2024)
Benchmarking LLM Code Generation for Audio Programming with Visual Dataflow Languages
by: Zhang, William, et al.
Published: (2024)
by: Zhang, William, et al.
Published: (2024)
Bootstrapping Code Translation with Weighted Multilanguage Exploration
by: Wu, Yuhan, et al.
Published: (2026)
by: Wu, Yuhan, et al.
Published: (2026)
Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization
by: Wu, Yuhan, et al.
Published: (2026)
by: Wu, Yuhan, et al.
Published: (2026)
Can Language Models Solve Olympiad Programming?
by: Shi, Quan, et al.
Published: (2024)
by: Shi, Quan, et al.
Published: (2024)
Quokka: Accelerating Program Verification with LLMs via Invariant Synthesis
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff
by: Tang, Hao, et al.
Published: (2024)
by: Tang, Hao, et al.
Published: (2024)
SuperCoder: Assembly Program Superoptimization with Large Language Models
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation
by: Sun, Zhensu, et al.
Published: (2024)
by: Sun, Zhensu, et al.
Published: (2024)
A Preliminary Study of Multilingual Code Language Models for Code Generation Task Using Translated Benchmarks
by: Dandamudi, Rohit, et al.
Published: (2024)
by: Dandamudi, Rohit, et al.
Published: (2024)
Program Machine Policy: Addressing Long-Horizon Tasks by Integrating Program Synthesis and State Machines
by: Lin, Yu-An, et al.
Published: (2023)
by: Lin, Yu-An, et al.
Published: (2023)
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators
by: Paul, Indraneil, et al.
Published: (2024)
by: Paul, Indraneil, et al.
Published: (2024)
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules
by: Le, Hung, et al.
Published: (2023)
by: Le, Hung, et al.
Published: (2023)
Compiler-Guided Inference-Time Adaptation: Improving GPT-5 Programming Performance in Idris
by: Li, Minda, et al.
Published: (2026)
by: Li, Minda, et al.
Published: (2026)
Emergent Representations of Program Semantics in Language Models Trained on Programs
by: Jin, Charles, et al.
Published: (2023)
by: Jin, Charles, et al.
Published: (2023)
Neural Task Synthesis for Visual Programming
by: Pădurean, Victor-Alexandru, et al.
Published: (2023)
by: Pădurean, Victor-Alexandru, et al.
Published: (2023)
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts
by: Dong, Honghua, et al.
Published: (2024)
by: Dong, Honghua, et al.
Published: (2024)
VisCoder2: Building Multi-Language Visualization Coding Agents
by: Ni, Yuansheng, et al.
Published: (2025)
by: Ni, Yuansheng, et al.
Published: (2025)
Probabilistic Programming with Programmable Variational Inference
by: Becker, McCoy R., et al.
Published: (2024)
by: Becker, McCoy R., et al.
Published: (2024)
Inference Plans for Hybrid Particle Filtering
by: Cheng, Ellie Y., et al.
Published: (2024)
by: Cheng, Ellie Y., et al.
Published: (2024)
EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
PDL: A Declarative Prompt Programming Language
by: Vaziri, Mandana, et al.
Published: (2024)
by: Vaziri, Mandana, et al.
Published: (2024)
Unfamiliar Finetuning Examples Control How Language Models Hallucinate
by: Kang, Katie, et al.
Published: (2024)
by: Kang, Katie, et al.
Published: (2024)
Towards Repository-Level Program Verification with Large Language Models
by: Zhong, Si Cheng, et al.
Published: (2025)
by: Zhong, Si Cheng, et al.
Published: (2025)
CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation
by: Wang, Peiding, et al.
Published: (2025)
by: Wang, Peiding, et al.
Published: (2025)
Sharing State Between Prompts and Programs
by: Cheng, Ellie Y., et al.
Published: (2025)
by: Cheng, Ellie Y., et al.
Published: (2025)
MonoCoder: Domain-Specific Code Language Model for HPC Codes and Tasks
by: Kadosh, Tal, et al.
Published: (2023)
by: Kadosh, Tal, et al.
Published: (2023)
The IsalProgram Programming Language
by: López-Rubio, Ezequiel
Published: (2026)
by: López-Rubio, Ezequiel
Published: (2026)
CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
by: Lin, Zhenru, et al.
Published: (2024)
by: Lin, Zhenru, et al.
Published: (2024)
Code Simulation Challenges for Large Language Models
by: La Malfa, Emanuele, et al.
Published: (2024)
by: La Malfa, Emanuele, et al.
Published: (2024)
AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents
by: Xu, Shuyuan, et al.
Published: (2024)
by: Xu, Shuyuan, et al.
Published: (2024)
SGLang: Efficient Execution of Structured Language Model Programs
by: Zheng, Lianmin, et al.
Published: (2023)
by: Zheng, Lianmin, et al.
Published: (2023)
Similar Items
-
Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus
by: Zhang, Huan, et al.
Published: (2026) -
PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models
by: Chen, Simin, et al.
Published: (2024) -
DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data
by: Wang, Bin, et al.
Published: (2024) -
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis
by: Wei, Anjiang, et al.
Published: (2025) -
CodeMind: Evaluating Large Language Models for Code Reasoning
by: Liu, Changshu, et al.
Published: (2024)