Saved in:
| Main Authors: | Xu, Manjie, Yin, Isabella, Tu, Xinyi, Zhang, Chi, Zhu, Yixin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.18352 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning to Plan with Personalized Preferences
by: Xu, Manjie, et al.
Published: (2025)
by: Xu, Manjie, et al.
Published: (2025)
Code Execution as Grounded Supervision for LLM Reasoning
by: Jung, Dongwon, et al.
Published: (2025)
by: Jung, Dongwon, et al.
Published: (2025)
Boosting Chart-to-Code Generation in MLLM via Dual Preference-Guided Refinement
by: Zhang, Zhihan, et al.
Published: (2025)
by: Zhang, Zhihan, et al.
Published: (2025)
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
by: Li, Junlong, et al.
Published: (2025)
by: Li, Junlong, et al.
Published: (2025)
Heterogeneous Adversarial Play in Interactive Environments
by: Xu, Manjie, et al.
Published: (2025)
by: Xu, Manjie, et al.
Published: (2025)
Improving LLM Code Reasoning via Semantic Equivalence Self-Play with Formal Verification
by: Barone, Antonio Valerio Miceli, et al.
Published: (2026)
by: Barone, Antonio Valerio Miceli, et al.
Published: (2026)
CodeGraph: Enhancing Graph Reasoning of LLMs with Code
by: Cai, Qiaolong, et al.
Published: (2024)
by: Cai, Qiaolong, et al.
Published: (2024)
Context Pruning for Coding Agents via Multi-Rubric Latent Reasoning
by: Wang, Jingjing, et al.
Published: (2026)
by: Wang, Jingjing, et al.
Published: (2026)
R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning
by: Chen, Yongchao, et al.
Published: (2025)
by: Chen, Yongchao, et al.
Published: (2025)
CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment
by: Jiang, Xue, et al.
Published: (2025)
by: Jiang, Xue, et al.
Published: (2025)
Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding
by: Di, Yifeng, et al.
Published: (2025)
by: Di, Yifeng, et al.
Published: (2025)
CRANE: Constrained Reasoning Injection for Code Agents via Nullspace Editing
by: Zhu, Mingzhi, et al.
Published: (2026)
by: Zhu, Mingzhi, et al.
Published: (2026)
Controlled Self-Evolution for Algorithmic Code Optimization
by: Hu, Tu, et al.
Published: (2026)
by: Hu, Tu, et al.
Published: (2026)
SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning
by: Ding, Yangruibo, et al.
Published: (2024)
by: Ding, Yangruibo, et al.
Published: (2024)
Code-driven Number Sequence Calculation: Enhancing the inductive Reasoning Abilities of Large Language Models
by: Chen, Kedi, et al.
Published: (2025)
by: Chen, Kedi, et al.
Published: (2025)
CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
by: Yu, Huimu, et al.
Published: (2024)
by: Yu, Huimu, et al.
Published: (2024)
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation
by: Hu, Wenhao, et al.
Published: (2025)
by: Hu, Wenhao, et al.
Published: (2025)
Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs
by: Yang, Dayu, et al.
Published: (2025)
by: Yang, Dayu, et al.
Published: (2025)
CodeMind: Evaluating Large Language Models for Code Reasoning
by: Liu, Changshu, et al.
Published: (2024)
by: Liu, Changshu, et al.
Published: (2024)
UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts
by: Yang, Bo, et al.
Published: (2024)
by: Yang, Bo, et al.
Published: (2024)
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding
by: Xu, Zhangchen, et al.
Published: (2025)
by: Xu, Zhangchen, et al.
Published: (2025)
To Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-Maximization
by: Wang, Haozhe, et al.
Published: (2025)
by: Wang, Haozhe, et al.
Published: (2025)
Reinforced Efficient Reasoning via Semantically Diverse Exploration
by: Zhao, Ziqi, et al.
Published: (2026)
by: Zhao, Ziqi, et al.
Published: (2026)
CRScore: Grounding Automated Evaluation of Code Review Comments in Code Claims and Smells
by: Naik, Atharva, et al.
Published: (2024)
by: Naik, Atharva, et al.
Published: (2024)
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
by: Gehring, Jonas, et al.
Published: (2024)
by: Gehring, Jonas, et al.
Published: (2024)
OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding
by: Ding, Deming, et al.
Published: (2026)
by: Ding, Deming, et al.
Published: (2026)
Towards Effective Code-Integrated Reasoning
by: Bai, Fei, et al.
Published: (2025)
by: Bai, Fei, et al.
Published: (2025)
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records
by: Shi, Wenqi, et al.
Published: (2024)
by: Shi, Wenqi, et al.
Published: (2024)
ReCode: Reinforcing Code Generation with Reasoning-Process Rewards
by: Fan, Lishui, et al.
Published: (2025)
by: Fan, Lishui, et al.
Published: (2025)
What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
by: Zhao, Yuze, et al.
Published: (2026)
by: Zhao, Yuze, et al.
Published: (2026)
SciCode: A Research Coding Benchmark Curated by Scientists
by: Tian, Minyang, et al.
Published: (2024)
by: Tian, Minyang, et al.
Published: (2024)
SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration
by: Zhang, Xichen, et al.
Published: (2025)
by: Zhang, Xichen, et al.
Published: (2025)
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
by: Wan, Yixin, et al.
Published: (2023)
by: Wan, Yixin, et al.
Published: (2023)
CATArena: Evaluating Evolutionary Capabilities of Code Agents via Iterative Tournaments
by: Fu, Lingyue, et al.
Published: (2025)
by: Fu, Lingyue, et al.
Published: (2025)
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning
by: Yin, Shuo, et al.
Published: (2024)
by: Yin, Shuo, et al.
Published: (2024)
Coding Agents are Effective Long-Context Processors
by: Cao, Weili, et al.
Published: (2026)
by: Cao, Weili, et al.
Published: (2026)
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation
by: Xu, Bin, et al.
Published: (2024)
by: Xu, Bin, et al.
Published: (2024)
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning
by: Chen, Yang, et al.
Published: (2025)
by: Chen, Yang, et al.
Published: (2025)
Coding Triangle: How Does Large Language Model Understand Code?
by: Zhang, Taolin, et al.
Published: (2025)
by: Zhang, Taolin, et al.
Published: (2025)
Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs
by: Zhu, Fengbin, et al.
Published: (2023)
by: Zhu, Fengbin, et al.
Published: (2023)
Similar Items
-
Learning to Plan with Personalized Preferences
by: Xu, Manjie, et al.
Published: (2025) -
Code Execution as Grounded Supervision for LLM Reasoning
by: Jung, Dongwon, et al.
Published: (2025) -
Boosting Chart-to-Code Generation in MLLM via Dual Preference-Guided Refinement
by: Zhang, Zhihan, et al.
Published: (2025) -
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
by: Li, Junlong, et al.
Published: (2025) -
Heterogeneous Adversarial Play in Interactive Environments
by: Xu, Manjie, et al.
Published: (2025)