Saved in:
| Main Author: | Liao, Hsien-Jyh |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06413 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enforcing Monotonic Progress in Legal Cross-Examination: Preventing Long-Horizon Stagnation in LLM-Based Inquiry
by: Liao, Hsien-Jyh
Published: (2026)
by: Liao, Hsien-Jyh
Published: (2026)
The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs
by: Samiei, Mahdi, et al.
Published: (2025)
by: Samiei, Mahdi, et al.
Published: (2025)
Intrinsic Credit Assignment for Long Horizon Interaction
by: Auzina, Ilze Amanda, et al.
Published: (2026)
by: Auzina, Ilze Amanda, et al.
Published: (2026)
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025)
by: Sinha, Akshit, et al.
Published: (2025)
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds
by: Anokhin, Petr, et al.
Published: (2025)
by: Anokhin, Petr, et al.
Published: (2025)
SAGE: Scene Graph-Aware Guidance and Execution for Long-Horizon Manipulation Tasks
by: Li, Jialiang, et al.
Published: (2025)
by: Li, Jialiang, et al.
Published: (2025)
PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
by: Orimo, Yuki, et al.
Published: (2025)
by: Orimo, Yuki, et al.
Published: (2025)
LEAD: Breaking the No-Recovery Bottleneck in Long-Horizon Reasoning
by: Pushkin, Denys, et al.
Published: (2026)
by: Pushkin, Denys, et al.
Published: (2026)
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
by: Hu, Xavier, et al.
Published: (2026)
by: Hu, Xavier, et al.
Published: (2026)
Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning
by: Hwang, Jaebak, et al.
Published: (2025)
by: Hwang, Jaebak, et al.
Published: (2025)
When Robots Do the Chores: A Benchmark and Agent for Long-Horizon Household Task Execution
by: Zhu, Zilin, et al.
Published: (2026)
by: Zhu, Zilin, et al.
Published: (2026)
Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
by: Deng, Zehao, et al.
Published: (2025)
by: Deng, Zehao, et al.
Published: (2025)
LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)
SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
by: Wang, Tianyi, et al.
Published: (2026)
by: Wang, Tianyi, et al.
Published: (2026)
SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent
by: Hu, Yuyang, et al.
Published: (2026)
by: Hu, Yuyang, et al.
Published: (2026)
Long-Horizon Plan Execution in Large Tool Spaces through Entropy-Guided Branching
by: Wei, Rongzhe, et al.
Published: (2026)
by: Wei, Rongzhe, et al.
Published: (2026)
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
by: Li, Junlong, et al.
Published: (2025)
by: Li, Junlong, et al.
Published: (2025)
Environment Maps: Structured Environmental Representations for Long-Horizon Agents
by: Feng, Yenchia, et al.
Published: (2026)
by: Feng, Yenchia, et al.
Published: (2026)
EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning
by: Hu, Chuanrui, et al.
Published: (2026)
by: Hu, Chuanrui, et al.
Published: (2026)
COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context
by: Wan, Guangya, et al.
Published: (2025)
by: Wan, Guangya, et al.
Published: (2025)
MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning
by: Liu, Yuxin, et al.
Published: (2026)
by: Liu, Yuxin, et al.
Published: (2026)
SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models
by: Monti, Sebastiano, et al.
Published: (2026)
by: Monti, Sebastiano, et al.
Published: (2026)
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
by: Shi, Yaorui, et al.
Published: (2026)
by: Shi, Yaorui, et al.
Published: (2026)
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
by: Wang, Tianle, et al.
Published: (2026)
by: Wang, Tianle, et al.
Published: (2026)
Prune-OPD: Efficient and Reliable On-Policy Distillation for Long-Horizon Reasoning
by: Yang, Zhicheng, et al.
Published: (2026)
by: Yang, Zhicheng, et al.
Published: (2026)
When to Re-Commit: Temporal Abstraction Discovery for Long-Horizon Vision-Language Reasoning
by: Li, Chen, et al.
Published: (2026)
by: Li, Chen, et al.
Published: (2026)
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
by: Yan, Hao, et al.
Published: (2026)
by: Yan, Hao, et al.
Published: (2026)
MemPO: Self-Memory Policy Optimization for Long-Horizon Agents
by: Li, Ruoran, et al.
Published: (2026)
by: Li, Ruoran, et al.
Published: (2026)
Dynamic Intelligence Ceilings: Measuring Long-Horizon Limits of Planning and Creativity in Artificial Systems
by: Khanh, Truong Xuan, et al.
Published: (2026)
by: Khanh, Truong Xuan, et al.
Published: (2026)
Reasoning with Autoregressive-Diffusion Collaborative Thoughts
by: Yuan, Mu, et al.
Published: (2026)
by: Yuan, Mu, et al.
Published: (2026)
Limits to AI Growth: The Ecological and Social Consequences of Scaling
by: Bhardwaj, Eshta, et al.
Published: (2025)
by: Bhardwaj, Eshta, et al.
Published: (2025)
HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark
by: Wang, Jiacheng, et al.
Published: (2026)
by: Wang, Jiacheng, et al.
Published: (2026)
ReFlect: An Effective Harness System for Complex Long-Horizon LLM Reasoning
by: Huang, Fan
Published: (2026)
by: Huang, Fan
Published: (2026)
AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
by: Hu, Yuyang, et al.
Published: (2026)
by: Hu, Yuyang, et al.
Published: (2026)
Harmonizing Real-Time Constraints and Long-Horizon Reasoning: An Asynchronous Agentic Framework for Dynamic Scheduling
by: Cao, Shijie, et al.
Published: (2026)
by: Cao, Shijie, et al.
Published: (2026)
CoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMs
by: Vaghasiya, Jay, et al.
Published: (2025)
by: Vaghasiya, Jay, et al.
Published: (2025)
SAG-Agent: Enabling Long-Horizon Reasoning in Strategy Games via Dynamic Knowledge Graphs
by: Tang, Chenwei, et al.
Published: (2025)
by: Tang, Chenwei, et al.
Published: (2025)
HorizonBench: Long-Horizon Personalization with Evolving Preferences
by: Li, Shuyue Stella, et al.
Published: (2026)
by: Li, Shuyue Stella, et al.
Published: (2026)
$\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control
by: Chen, Xianwei, et al.
Published: (2026)
by: Chen, Xianwei, et al.
Published: (2026)
On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning
by: Lyu, Siyi, et al.
Published: (2026)
by: Lyu, Siyi, et al.
Published: (2026)
Similar Items
-
Enforcing Monotonic Progress in Legal Cross-Examination: Preventing Long-Horizon Stagnation in LLM-Based Inquiry
by: Liao, Hsien-Jyh
Published: (2026) -
The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs
by: Samiei, Mahdi, et al.
Published: (2025) -
Intrinsic Credit Assignment for Long Horizon Interaction
by: Auzina, Ilze Amanda, et al.
Published: (2026) -
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025) -
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds
by: Anokhin, Petr, et al.
Published: (2025)