Saved in:
| Main Authors: | Liu, Xin, Wang, Lu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.02536 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models
by: Min, Dehai, et al.
Published: (2026)
by: Min, Dehai, et al.
Published: (2026)
Stop When Enough: Adaptive Early-Stopping for Chain-of-Thought Reasoning
by: Sun, Renliang, et al.
Published: (2025)
by: Sun, Renliang, et al.
Published: (2025)
TRACES: Tagging Reasoning Steps for Adaptive Cost-Efficient Early-Stopping
by: Belkhiter, Yannis, et al.
Published: (2026)
by: Belkhiter, Yannis, et al.
Published: (2026)
Adaptive Stopping for Multi-Turn LLM Reasoning
by: Zhou, Xiaofan, et al.
Published: (2026)
by: Zhou, Xiaofan, et al.
Published: (2026)
Early Stopping for Large Reasoning Models via Confidence Dynamics
by: Hosseini, Parsa, et al.
Published: (2026)
by: Hosseini, Parsa, et al.
Published: (2026)
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
by: Wang, Changyue, et al.
Published: (2025)
by: Wang, Changyue, et al.
Published: (2025)
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers
by: Xin, Rihui, et al.
Published: (2025)
by: Xin, Rihui, et al.
Published: (2025)
Sandwich Reasoning: An Answer-Reasoning-Answer Approach for Low-Latency Query Correction
by: Zhang, Chen, et al.
Published: (2026)
by: Zhang, Chen, et al.
Published: (2026)
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning
by: Nagle, Alliot, et al.
Published: (2026)
by: Nagle, Alliot, et al.
Published: (2026)
Early Stopping Chain-of-thoughts in Large Language Models
by: Mao, Minjia, et al.
Published: (2025)
by: Mao, Minjia, et al.
Published: (2025)
Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability
by: Wang, Haotian, et al.
Published: (2025)
by: Wang, Haotian, et al.
Published: (2025)
Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding
by: Oba, Daisuke, et al.
Published: (2026)
by: Oba, Daisuke, et al.
Published: (2026)
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
by: Sui, Yang, et al.
Published: (2025)
by: Sui, Yang, et al.
Published: (2025)
From Answers to Rationales: Self-Aligning Multimodal Reasoning with Answer-Oriented Chain-of-Thought
by: Tan, Wentao, et al.
Published: (2025)
by: Tan, Wentao, et al.
Published: (2025)
Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models
by: Wang, Yuhui, et al.
Published: (2025)
by: Wang, Yuhui, et al.
Published: (2025)
Knowing When to Stop: Efficient Context Processing via Latent Sufficiency Signals
by: Xie, Roy, et al.
Published: (2025)
by: Xie, Roy, et al.
Published: (2025)
Think Through Uncertainty: Improving Long-Form Generation Factuality via Reasoning Calibration
by: Liu, Xin, et al.
Published: (2026)
by: Liu, Xin, et al.
Published: (2026)
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
by: Chen, Ding, et al.
Published: (2025)
by: Chen, Ding, et al.
Published: (2025)
Stop Fixating on Prompts: Reasoning Hijacking and Constraint Tightening for Red-Teaming LLM Agents
by: Mao, Yanxu, et al.
Published: (2026)
by: Mao, Yanxu, et al.
Published: (2026)
Answer-Centric or Reasoning-Driven? Uncovering the Latent Memory Anchor in LLMs
by: Wu, Yang, et al.
Published: (2025)
by: Wu, Yang, et al.
Published: (2025)
CGES: Confidence-Guided Early Stopping for Efficient and Accurate Self-Consistency
by: Aghazadeh, Ehsan, et al.
Published: (2025)
by: Aghazadeh, Ehsan, et al.
Published: (2025)
Think in Parallel, Answer as One: Logit Averaging for Open-Ended Reasoning
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
Stop Before You Fail: Operational Capability Boundaries for Mitigating Unproductive Reasoning in Large Reasoning Models
by: Zhang, Qingjie, et al.
Published: (2025)
by: Zhang, Qingjie, et al.
Published: (2025)
FLOP-Efficient Training: Early Stopping Based on Test-Time Compute Awareness
by: Amer, Hossam, et al.
Published: (2026)
by: Amer, Hossam, et al.
Published: (2026)
On the Robustness of Answer Formats in Medical Reasoning Models
by: Taveekitworachai, Pittawat, et al.
Published: (2025)
by: Taveekitworachai, Pittawat, et al.
Published: (2025)
Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models
by: Nie, Shuo, et al.
Published: (2026)
by: Nie, Shuo, et al.
Published: (2026)
Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking
by: Han, Jinyi, et al.
Published: (2025)
by: Han, Jinyi, et al.
Published: (2025)
Logit-Entropy Adaptive Stopping Heuristic for Efficient Chain-of-Thought Reasoning
by: Quamar, Mohammad Atif, et al.
Published: (2025)
by: Quamar, Mohammad Atif, et al.
Published: (2025)
Beg to Differ: Understanding Reasoning-Answer Misalignment Across Languages
by: Ovalle, Anaelia, et al.
Published: (2025)
by: Ovalle, Anaelia, et al.
Published: (2025)
Refining Answer Distributions for Improved Large Language Model Reasoning
by: Pal, Soumyasundar, et al.
Published: (2024)
by: Pal, Soumyasundar, et al.
Published: (2024)
DARL: Encouraging Diverse Answers for General Reasoning without Verifiers
by: Huang, Chongxuan, et al.
Published: (2026)
by: Huang, Chongxuan, et al.
Published: (2026)
Just on Time: Token-Level Early Stopping for Diffusion Language Models
by: Kohut, Zahar, et al.
Published: (2026)
by: Kohut, Zahar, et al.
Published: (2026)
Exploiting All Samples in Low-Resource Sentence Classification: Early Stopping and Initialization Parameters
by: Choi, Hongseok, et al.
Published: (2021)
by: Choi, Hongseok, et al.
Published: (2021)
Knowing When Not to Answer: Abstention-Aware Scientific Reasoning
by: Abdaljalil, Samir, et al.
Published: (2026)
by: Abdaljalil, Samir, et al.
Published: (2026)
Think Straight, Stop Smart: Structured Reasoning for Efficient Multi-Hop RAG
by: Bang, Jihwan, et al.
Published: (2025)
by: Bang, Jihwan, et al.
Published: (2025)
Clarify or Answer: Reinforcement Learning for Agentic VQA with Context Under-specification
by: Cao, Zongwan, et al.
Published: (2026)
by: Cao, Zongwan, et al.
Published: (2026)
How Long Reasoning Chains Influence LLMs' Judgment of Answer Factuality
by: Tu, Minzhu, et al.
Published: (2026)
by: Tu, Minzhu, et al.
Published: (2026)
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
by: Zhang, Zhihan, et al.
Published: (2024)
by: Zhang, Zhihan, et al.
Published: (2024)
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
by: Sheng, Leheng, et al.
Published: (2026)
by: Sheng, Leheng, et al.
Published: (2026)
From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Similar Items
-
Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models
by: Min, Dehai, et al.
Published: (2026) -
Stop When Enough: Adaptive Early-Stopping for Chain-of-Thought Reasoning
by: Sun, Renliang, et al.
Published: (2025) -
TRACES: Tagging Reasoning Steps for Adaptive Cost-Efficient Early-Stopping
by: Belkhiter, Yannis, et al.
Published: (2026) -
Adaptive Stopping for Multi-Turn LLM Reasoning
by: Zhou, Xiaofan, et al.
Published: (2026) -
Early Stopping for Large Reasoning Models via Confidence Dynamics
by: Hosseini, Parsa, et al.
Published: (2026)