:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pushkin, Denys, Abbe, Emmanuel
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.06870
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
by: Pushkin, Denys, et al.
Published: (2024)

Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning
by: Mahrooghi, Ilia, et al.
Published: (2026)

AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking
by: Gao, Silin, et al.
Published: (2025)

How Far Can Transformers Reason? The Globality Barrier and Inductive Scratchpad
by: Abbe, Emmanuel, et al.
Published: (2024)

Breaking the Context Bottleneck on Long Time Series Forecasting
by: Ma, Chao, et al.
Published: (2024)

$k$-server-bench: Automating Potential Discovery for the $k$-Server Conjecture
by: Brilliantov, Kirill, et al.
Published: (2026)

RL for Reasoning by Adaptively Revealing Rationales
by: Amani, Mohammad Hossein, et al.
Published: (2025)

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
by: Zhou, Yang, et al.
Published: (2025)

LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models
by: Wei, Songtao, et al.
Published: (2026)

The Long-Horizon Task Mirage? Diagnosing Where and Why Agentic Systems Break
by: Wang, Xinyu Jessica, et al.
Published: (2026)

The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas
by: Xu, Baixuan, et al.
Published: (2025)

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
by: Wang, Tianyi, et al.
Published: (2026)

SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent
by: Hu, Yuyang, et al.
Published: (2026)

The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs
by: Samiei, Mahdi, et al.
Published: (2025)

COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context
by: Wan, Guangya, et al.
Published: (2025)

MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning
by: Liu, Yuxin, et al.
Published: (2026)

Intrinsic Stability Limits of Autoregressive Reasoning: Structural Consequences for Long-Horizon Execution
by: Liao, Hsien-Jyh
Published: (2026)

SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models
by: Monti, Sebastiano, et al.
Published: (2026)

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
by: Shi, Yaorui, et al.
Published: (2026)

Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression
by: Saegert, Paul, et al.
Published: (2026)

Revisiting LLM Reasoning via Information Bottleneck
by: Lei, Shiye, et al.
Published: (2025)

From Differentiation to Cognition: UTD as a Model of Recursive Awareness
by: Spirin, Denys
Published: (2025)

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
by: Wang, Tianle, et al.
Published: (2026)

Prune-OPD: Efficient and Reliable On-Policy Distillation for Long-Horizon Reasoning
by: Yang, Zhicheng, et al.
Published: (2026)

When to Re-Commit: Temporal Abstraction Discovery for Long-Horizon Vision-Language Reasoning
by: Li, Chen, et al.
Published: (2026)

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds
by: Anokhin, Petr, et al.
Published: (2025)

Towards Reasonable Concept Bottleneck Models
by: Kalampalikis, Nektarios, et al.
Published: (2025)

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
by: Cheng, Zelei, et al.
Published: (2024)

ReFlect: An Effective Harness System for Complex Long-Horizon LLM Reasoning
by: Huang, Fan
Published: (2026)

AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
by: Hu, Yuyang, et al.
Published: (2026)

Harmonizing Real-Time Constraints and Long-Horizon Reasoning: An Asynchronous Agentic Framework for Dynamic Scheduling
by: Cao, Shijie, et al.
Published: (2026)

CoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMs
by: Vaghasiya, Jay, et al.
Published: (2025)

SAG-Agent: Enabling Long-Horizon Reasoning in Strategy Games via Dynamic Knowledge Graphs
by: Tang, Chenwei, et al.
Published: (2025)

LEAD: Latent Realignment for Human Motion Diffusion
by: Andreou, Nefeli, et al.
Published: (2024)

HorizonBench: Long-Horizon Personalization with Evolving Preferences
by: Li, Shuyue Stella, et al.
Published: (2026)

When can transformers reason with abstract symbols?
by: Boix-Adsera, Enric, et al.
Published: (2023)

Dynamic-TreeRPO: Breaking the Independent Trajectory Bottleneck with Structured Sampling
by: Fu, Xiaolong, et al.
Published: (2025)

LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning
by: Lin, Xiaotian, et al.
Published: (2025)

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning
by: Yu, Chengjun, et al.
Published: (2026)