:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Liao, Hsien-Jyh
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.06413
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Enforcing Monotonic Progress in Legal Cross-Examination: Preventing Long-Horizon Stagnation in LLM-Based Inquiry
by: Liao, Hsien-Jyh
Published: (2026)

The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs
by: Samiei, Mahdi, et al.
Published: (2025)

Intrinsic Credit Assignment for Long Horizon Interaction
by: Auzina, Ilze Amanda, et al.
Published: (2026)

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025)

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds
by: Anokhin, Petr, et al.
Published: (2025)

SAGE: Scene Graph-Aware Guidance and Execution for Long-Horizon Manipulation Tasks
by: Li, Jialiang, et al.
Published: (2025)

PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
by: Orimo, Yuki, et al.
Published: (2025)

LEAD: Breaking the No-Recovery Bottleneck in Long-Horizon Reasoning
by: Pushkin, Denys, et al.
Published: (2026)

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
by: Hu, Xavier, et al.
Published: (2026)

Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning
by: Hwang, Jaebak, et al.
Published: (2025)

When Robots Do the Chores: A Benchmark and Agent for Long-Horizon Household Task Execution
by: Zhu, Zilin, et al.
Published: (2026)

Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
by: Deng, Zehao, et al.
Published: (2025)

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
by: Wang, Tianyi, et al.
Published: (2026)

SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent
by: Hu, Yuyang, et al.
Published: (2026)

Long-Horizon Plan Execution in Large Tool Spaces through Entropy-Guided Branching
by: Wei, Rongzhe, et al.
Published: (2026)

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
by: Li, Junlong, et al.
Published: (2025)

Environment Maps: Structured Environmental Representations for Long-Horizon Agents
by: Feng, Yenchia, et al.
Published: (2026)

EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning
by: Hu, Chuanrui, et al.
Published: (2026)

COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context
by: Wan, Guangya, et al.
Published: (2025)

MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning
by: Liu, Yuxin, et al.
Published: (2026)

SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models
by: Monti, Sebastiano, et al.
Published: (2026)

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
by: Shi, Yaorui, et al.
Published: (2026)

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
by: Wang, Tianle, et al.
Published: (2026)

Prune-OPD: Efficient and Reliable On-Policy Distillation for Long-Horizon Reasoning
by: Yang, Zhicheng, et al.
Published: (2026)

When to Re-Commit: Temporal Abstraction Discovery for Long-Horizon Vision-Language Reasoning
by: Li, Chen, et al.
Published: (2026)

DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
by: Yan, Hao, et al.
Published: (2026)

MemPO: Self-Memory Policy Optimization for Long-Horizon Agents
by: Li, Ruoran, et al.
Published: (2026)

Dynamic Intelligence Ceilings: Measuring Long-Horizon Limits of Planning and Creativity in Artificial Systems
by: Khanh, Truong Xuan, et al.
Published: (2026)

Reasoning with Autoregressive-Diffusion Collaborative Thoughts
by: Yuan, Mu, et al.
Published: (2026)

Limits to AI Growth: The Ecological and Social Consequences of Scaling
by: Bhardwaj, Eshta, et al.
Published: (2025)

HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark
by: Wang, Jiacheng, et al.
Published: (2026)

ReFlect: An Effective Harness System for Complex Long-Horizon LLM Reasoning
by: Huang, Fan
Published: (2026)

AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
by: Hu, Yuyang, et al.
Published: (2026)

Harmonizing Real-Time Constraints and Long-Horizon Reasoning: An Asynchronous Agentic Framework for Dynamic Scheduling
by: Cao, Shijie, et al.
Published: (2026)

CoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMs
by: Vaghasiya, Jay, et al.
Published: (2025)

SAG-Agent: Enabling Long-Horizon Reasoning in Strategy Games via Dynamic Knowledge Graphs
by: Tang, Chenwei, et al.
Published: (2025)

HorizonBench: Long-Horizon Personalization with Evolving Preferences
by: Li, Shuyue Stella, et al.
Published: (2026)

$\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control
by: Chen, Xianwei, et al.
Published: (2026)

On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning
by: Lyu, Siyi, et al.
Published: (2026)