Saved in:
| Main Authors: | Yi, Jingyang, Wang, Jiazheng, Li, Sida |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.21370 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR
by: Bounhar, Abdelaziz, et al.
Published: (2025)
by: Bounhar, Abdelaziz, et al.
Published: (2025)
LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models
by: Wei, Songtao, et al.
Published: (2026)
by: Wei, Songtao, et al.
Published: (2026)
Implicit Compression Regularization: Concise Reasoning via Internal Shorter Distributions in RL Post-Training
by: Wang, Chen, et al.
Published: (2026)
by: Wang, Chen, et al.
Published: (2026)
Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
On the Optimal Reasoning Length for RL-Trained Language Models
by: Nohara, Daisuke, et al.
Published: (2026)
by: Nohara, Daisuke, et al.
Published: (2026)
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
by: Hassid, Michael, et al.
Published: (2025)
by: Hassid, Michael, et al.
Published: (2025)
Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
by: Li, Yanhao, et al.
Published: (2025)
by: Li, Yanhao, et al.
Published: (2025)
Towards Interpretable and Inference-Optimal COT Reasoning with Sparse Autoencoder-Guided Generation
by: Zhao, Daniel, et al.
Published: (2025)
by: Zhao, Daniel, et al.
Published: (2025)
Boosting Inference with Guided Reasoning: Stochastic Exploration for Recursive Models
by: Corbett, Andrew, et al.
Published: (2026)
by: Corbett, Andrew, et al.
Published: (2026)
Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?
by: Lee, Celine, et al.
Published: (2025)
by: Lee, Celine, et al.
Published: (2025)
Pseudocode-Guided Structured Reasoning for Automating Reliable Inference in Vision-Language Models
by: Ni, Weicong, et al.
Published: (2026)
by: Ni, Weicong, et al.
Published: (2026)
Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning
by: Li, Xintong, et al.
Published: (2026)
by: Li, Xintong, et al.
Published: (2026)
Learning to Self-Verify Makes Language Models Better Reasoners
by: Chen, Yuxin, et al.
Published: (2026)
by: Chen, Yuxin, et al.
Published: (2026)
Mixed Distillation Helps Smaller Language Model Better Reasoning
by: Li, Chenglin, et al.
Published: (2023)
by: Li, Chenglin, et al.
Published: (2023)
Reasoning Models Better Express Their Confidence
by: Yoon, Dongkeun, et al.
Published: (2025)
by: Yoon, Dongkeun, et al.
Published: (2025)
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models
by: Lian, Long, et al.
Published: (2025)
by: Lian, Long, et al.
Published: (2025)
An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)
by: Sun, Yi, et al.
Published: (2025)
CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models in Mathematical Reasoning
by: Zheng, Congmin, et al.
Published: (2025)
by: Zheng, Congmin, et al.
Published: (2025)
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
by: Sanyal, Soumya, et al.
Published: (2024)
by: Sanyal, Soumya, et al.
Published: (2024)
Entropy-Guided Data-Efficient Training for Multimodal Reasoning Reward Models
by: Yang, Shidong, et al.
Published: (2026)
by: Yang, Shidong, et al.
Published: (2026)
Timo: Towards Better Temporal Reasoning for Language Models
by: Su, Zhaochen, et al.
Published: (2024)
by: Su, Zhaochen, et al.
Published: (2024)
Optimizing Length Compression in Large Reasoning Models
by: Cheng, Zhengxiang, et al.
Published: (2025)
by: Cheng, Zhengxiang, et al.
Published: (2025)
EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models
by: Yan, Hongxi, et al.
Published: (2026)
by: Yan, Hongxi, et al.
Published: (2026)
Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition
by: Zeng, Zihao, et al.
Published: (2025)
by: Zeng, Zihao, et al.
Published: (2025)
Beyond Token Length: Step Pruner for Efficient and Accurate Reasoning in Large Language Models
by: Wu, Canhui, et al.
Published: (2025)
by: Wu, Canhui, et al.
Published: (2025)
Predictive Scheduling for Efficient Inference-Time Reasoning in Large Language Models
by: Brown, Katrina, et al.
Published: (2026)
by: Brown, Katrina, et al.
Published: (2026)
ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference
by: Wang, Junda, et al.
Published: (2026)
by: Wang, Junda, et al.
Published: (2026)
Trace Length is a Simple Uncertainty Signal in Reasoning Models
by: Devic, Siddartha, et al.
Published: (2025)
by: Devic, Siddartha, et al.
Published: (2025)
QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation
by: Li, Jiazheng, et al.
Published: (2025)
by: Li, Jiazheng, et al.
Published: (2025)
From Table to Cell: Attention for Better Reasoning with TABALIGN
by: Kwok, Tung Sum Thomas, et al.
Published: (2026)
by: Kwok, Tung Sum Thomas, et al.
Published: (2026)
Experience-Guided Adaptation of Inference-Time Reasoning Strategies
by: Stein, Adam, et al.
Published: (2025)
by: Stein, Adam, et al.
Published: (2025)
Efficient Reasoning via Reward Model
by: Wang, Yuhao, et al.
Published: (2025)
by: Wang, Yuhao, et al.
Published: (2025)
Searching Meta Reasoning Skeleton to Guide LLM Reasoning
by: Zhang, Ziying, et al.
Published: (2025)
by: Zhang, Ziying, et al.
Published: (2025)
CausalEval: Towards Better Causal Reasoning in Language Models
by: Yu, Longxuan, et al.
Published: (2024)
by: Yu, Longxuan, et al.
Published: (2024)
Hawkeye:Efficient Reasoning with Model Collaboration
by: She, Jianshu, et al.
Published: (2025)
by: She, Jianshu, et al.
Published: (2025)
The Impact of Reasoning Step Length on Large Language Models
by: Jin, Mingyu, et al.
Published: (2024)
by: Jin, Mingyu, et al.
Published: (2024)
Abstraction-of-Thought Makes Language Models Better Reasoners
by: Hong, Ruixin, et al.
Published: (2024)
by: Hong, Ruixin, et al.
Published: (2024)
A Theory for Length Generalization in Learning to Reason
by: Xiao, Changnan, et al.
Published: (2024)
by: Xiao, Changnan, et al.
Published: (2024)
Optimal Self-Consistency for Efficient Reasoning with Large Language Models
by: Feng, Austin, et al.
Published: (2025)
by: Feng, Austin, et al.
Published: (2025)
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
by: Wang, Rui, et al.
Published: (2025)
by: Wang, Rui, et al.
Published: (2025)
Similar Items
-
Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR
by: Bounhar, Abdelaziz, et al.
Published: (2025) -
LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models
by: Wei, Songtao, et al.
Published: (2026) -
Implicit Compression Regularization: Concise Reasoning via Internal Shorter Distributions in RL Post-Training
by: Wang, Chen, et al.
Published: (2026) -
Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models
by: Wu, Wei, et al.
Published: (2026) -
On the Optimal Reasoning Length for RL-Trained Language Models
by: Nohara, Daisuke, et al.
Published: (2026)