:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xiao, Changnan, Liu, Bing
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.00560
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Open-World Continual Learning: Unifying Novelty Detection and Continual Learning
by: Kim, Gyuhak, et al.
Published: (2023)

PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models
by: Xu, Yinggan, et al.
Published: (2025)

SmartThinker: Learning to Compress and Preserve Reasoning by Step-Level Length Control
by: He, Xingyang, et al.
Published: (2025)

On the Step Length Confounding in LLM Reasoning Data Selection
by: Wang, Bing, et al.
Published: (2026)

AnaCP: Toward Upper-Bound Continual Learning via Analytic Contrastive Projection
by: Momeni, Saleh, et al.
Published: (2025)

Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization
by: Huang, Yu, et al.
Published: (2025)

Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance
by: Xiong, Kai, et al.
Published: (2024)

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
by: Liu, Wei, et al.
Published: (2025)

More Thinking, More Bias: Length-Driven Position Bias in Reasoning Models
by: Wang, Xiao
Published: (2026)

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning
by: Li, Xintong, et al.
Published: (2026)

An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)

DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning
by: He, Yang, et al.
Published: (2026)

Relational Learning in Pre-Trained Models: A Theory from Hypergraph Recovery Perspective
by: Chen, Yang, et al.
Published: (2024)

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
by: Wu, Xingyu, et al.
Published: (2025)

ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning
by: Yi, Jingyang, et al.
Published: (2025)

Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
by: Xiang, Violet, et al.
Published: (2025)

Optimizing Length Compression in Large Reasoning Models
by: Cheng, Zhengxiang, et al.
Published: (2025)

Trace Length is a Simple Uncertainty Signal in Reasoning Models
by: Devic, Siddartha, et al.
Published: (2025)

Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models
by: Wu, Wei, et al.
Published: (2026)

Learning Dynamic Belief Graphs for Theory-of-mind Reasoning
by: Chen, Ruxiao, et al.
Published: (2026)

GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning
by: Han, Xiao, et al.
Published: (2026)

Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?
by: Lee, Celine, et al.
Published: (2025)

The Impact of Reasoning Step Length on Large Language Models
by: Jin, Mingyu, et al.
Published: (2024)

GR-Ben: A General Reasoning Benchmark for Evaluating Process Reward Models
by: Sun, Zhouhao, et al.
Published: (2026)

Boule or Baguette? A Study on Task Topology, Length Generalization, and the Benefit of Reasoning Traces
by: Tong, William L., et al.
Published: (2026)

MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind
by: Zhang, Zheng, et al.
Published: (2025)

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count
by: Cho, Hanseul, et al.
Published: (2024)

Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
by: Li, Yanhao, et al.
Published: (2025)

ReasonGRM: Enhancing Generative Reward Models through Large Reasoning Models
by: Chen, Bin, et al.
Published: (2025)

First Try Matters: Revisiting the Role of Reflection in Reasoning Models
by: Kang, Liwei, et al.
Published: (2025)

Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
by: Nayab, Sania, et al.
Published: (2024)

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models
by: Zhang, Qiyuan, et al.
Published: (2026)

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
by: Yan, Yuchen, et al.
Published: (2025)

Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning
by: Oh, Nick, et al.
Published: (2025)

Protein-Conditioned Multi-Objective Reinforcement Learning for Full-Length mRNA Design
by: Shao, Zixi, et al.
Published: (2026)

Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning
by: Hu, Boren, et al.
Published: (2026)

On Vanishing Variance in Transformer Length Generalization
by: Li, Ruining, et al.
Published: (2025)

Mamba Modulation: On the Length Generalization of Mamba
by: Lu, Peng, et al.
Published: (2025)

Prompt-Based Length Controlled Generation with Multiple Control Types
by: Jie, Renlong, et al.
Published: (2024)

On the Optimal Reasoning Length for RL-Trained Language Models
by: Nohara, Daisuke, et al.
Published: (2026)