Saved in:
| Main Authors: | Xiao, Changnan, Liu, Bing |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00560 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Open-World Continual Learning: Unifying Novelty Detection and Continual Learning
by: Kim, Gyuhak, et al.
Published: (2023)
by: Kim, Gyuhak, et al.
Published: (2023)
PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models
by: Xu, Yinggan, et al.
Published: (2025)
by: Xu, Yinggan, et al.
Published: (2025)
SmartThinker: Learning to Compress and Preserve Reasoning by Step-Level Length Control
by: He, Xingyang, et al.
Published: (2025)
by: He, Xingyang, et al.
Published: (2025)
On the Step Length Confounding in LLM Reasoning Data Selection
by: Wang, Bing, et al.
Published: (2026)
by: Wang, Bing, et al.
Published: (2026)
AnaCP: Toward Upper-Bound Continual Learning via Analytic Contrastive Projection
by: Momeni, Saleh, et al.
Published: (2025)
by: Momeni, Saleh, et al.
Published: (2025)
Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization
by: Huang, Yu, et al.
Published: (2025)
by: Huang, Yu, et al.
Published: (2025)
Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance
by: Xiong, Kai, et al.
Published: (2024)
by: Xiong, Kai, et al.
Published: (2024)
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
by: Liu, Wei, et al.
Published: (2025)
by: Liu, Wei, et al.
Published: (2025)
More Thinking, More Bias: Length-Driven Position Bias in Reasoning Models
by: Wang, Xiao
Published: (2026)
by: Wang, Xiao
Published: (2026)
Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning
by: Li, Xintong, et al.
Published: (2026)
by: Li, Xintong, et al.
Published: (2026)
An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)
by: Sun, Yi, et al.
Published: (2025)
DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning
by: He, Yang, et al.
Published: (2026)
by: He, Yang, et al.
Published: (2026)
Relational Learning in Pre-Trained Models: A Theory from Hypergraph Recovery Perspective
by: Chen, Yang, et al.
Published: (2024)
by: Chen, Yang, et al.
Published: (2024)
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
by: Wu, Xingyu, et al.
Published: (2025)
by: Wu, Xingyu, et al.
Published: (2025)
ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning
by: Yi, Jingyang, et al.
Published: (2025)
by: Yi, Jingyang, et al.
Published: (2025)
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
by: Xiang, Violet, et al.
Published: (2025)
by: Xiang, Violet, et al.
Published: (2025)
Optimizing Length Compression in Large Reasoning Models
by: Cheng, Zhengxiang, et al.
Published: (2025)
by: Cheng, Zhengxiang, et al.
Published: (2025)
Trace Length is a Simple Uncertainty Signal in Reasoning Models
by: Devic, Siddartha, et al.
Published: (2025)
by: Devic, Siddartha, et al.
Published: (2025)
Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
Learning Dynamic Belief Graphs for Theory-of-mind Reasoning
by: Chen, Ruxiao, et al.
Published: (2026)
by: Chen, Ruxiao, et al.
Published: (2026)
GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning
by: Han, Xiao, et al.
Published: (2026)
by: Han, Xiao, et al.
Published: (2026)
Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?
by: Lee, Celine, et al.
Published: (2025)
by: Lee, Celine, et al.
Published: (2025)
The Impact of Reasoning Step Length on Large Language Models
by: Jin, Mingyu, et al.
Published: (2024)
by: Jin, Mingyu, et al.
Published: (2024)
GR-Ben: A General Reasoning Benchmark for Evaluating Process Reward Models
by: Sun, Zhouhao, et al.
Published: (2026)
by: Sun, Zhouhao, et al.
Published: (2026)
Boule or Baguette? A Study on Task Topology, Length Generalization, and the Benefit of Reasoning Traces
by: Tong, William L., et al.
Published: (2026)
by: Tong, William L., et al.
Published: (2026)
MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count
by: Cho, Hanseul, et al.
Published: (2024)
by: Cho, Hanseul, et al.
Published: (2024)
Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
by: Li, Yanhao, et al.
Published: (2025)
by: Li, Yanhao, et al.
Published: (2025)
ReasonGRM: Enhancing Generative Reward Models through Large Reasoning Models
by: Chen, Bin, et al.
Published: (2025)
by: Chen, Bin, et al.
Published: (2025)
First Try Matters: Revisiting the Role of Reflection in Reasoning Models
by: Kang, Liwei, et al.
Published: (2025)
by: Kang, Liwei, et al.
Published: (2025)
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
by: Nayab, Sania, et al.
Published: (2024)
by: Nayab, Sania, et al.
Published: (2024)
Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models
by: Zhang, Qiyuan, et al.
Published: (2026)
by: Zhang, Qiyuan, et al.
Published: (2026)
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
by: Yan, Yuchen, et al.
Published: (2025)
by: Yan, Yuchen, et al.
Published: (2025)
Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning
by: Oh, Nick, et al.
Published: (2025)
by: Oh, Nick, et al.
Published: (2025)
Protein-Conditioned Multi-Objective Reinforcement Learning for Full-Length mRNA Design
by: Shao, Zixi, et al.
Published: (2026)
by: Shao, Zixi, et al.
Published: (2026)
Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning
by: Hu, Boren, et al.
Published: (2026)
by: Hu, Boren, et al.
Published: (2026)
On Vanishing Variance in Transformer Length Generalization
by: Li, Ruining, et al.
Published: (2025)
by: Li, Ruining, et al.
Published: (2025)
Mamba Modulation: On the Length Generalization of Mamba
by: Lu, Peng, et al.
Published: (2025)
by: Lu, Peng, et al.
Published: (2025)
Prompt-Based Length Controlled Generation with Multiple Control Types
by: Jie, Renlong, et al.
Published: (2024)
by: Jie, Renlong, et al.
Published: (2024)
On the Optimal Reasoning Length for RL-Trained Language Models
by: Nohara, Daisuke, et al.
Published: (2026)
by: Nohara, Daisuke, et al.
Published: (2026)
Similar Items
-
Open-World Continual Learning: Unifying Novelty Detection and Continual Learning
by: Kim, Gyuhak, et al.
Published: (2023) -
PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models
by: Xu, Yinggan, et al.
Published: (2025) -
SmartThinker: Learning to Compress and Preserve Reasoning by Step-Level Length Control
by: He, Xingyang, et al.
Published: (2025) -
On the Step Length Confounding in LLM Reasoning Data Selection
by: Wang, Bing, et al.
Published: (2026) -
AnaCP: Toward Upper-Bound Continual Learning via Analytic Contrastive Projection
by: Momeni, Saleh, et al.
Published: (2025)