Saved in:
| Main Author: | Liao, Hsien-Jyh |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04206 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Intrinsic Stability Limits of Autoregressive Reasoning: Structural Consequences for Long-Horizon Execution
by: Liao, Hsien-Jyh
Published: (2026)
by: Liao, Hsien-Jyh
Published: (2026)
GPTs and Language Barrier: A Cross-Lingual Legal QA Examination
by: Nguyen, Ha-Thanh, et al.
Published: (2024)
by: Nguyen, Ha-Thanh, et al.
Published: (2024)
Causal Intervention-Based Memory Selection for Long-Horizon LLM Agents
by: Srivastava, Saksham Sahai
Published: (2026)
by: Srivastava, Saksham Sahai
Published: (2026)
Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model
by: Lin, Chun-Hsien, et al.
Published: (2024)
by: Lin, Chun-Hsien, et al.
Published: (2024)
Dynamic Long Short-Term Memory Based Memory Storage For Long Horizon LLM Interaction
by: Lou, Yuyang, et al.
Published: (2025)
by: Lou, Yuyang, et al.
Published: (2025)
Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations
by: Liu, Ziyang
Published: (2026)
by: Liu, Ziyang
Published: (2026)
Enforcing Temporal Constraints for LLM Agents
by: Kamath, Adharsh, et al.
Published: (2025)
by: Kamath, Adharsh, et al.
Published: (2025)
HorizonBench: Long-Horizon Personalization with Evolving Preferences
by: Li, Shuyue Stella, et al.
Published: (2026)
by: Li, Shuyue Stella, et al.
Published: (2026)
ReFlect: An Effective Harness System for Complex Long-Horizon LLM Reasoning
by: Huang, Fan
Published: (2026)
by: Huang, Fan
Published: (2026)
The Lock-in Hypothesis: Stagnation by Algorithm
by: Qiu, Tianyi Alex, et al.
Published: (2025)
by: Qiu, Tianyi Alex, et al.
Published: (2025)
LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
by: Enguehard, Joseph, et al.
Published: (2025)
by: Enguehard, Joseph, et al.
Published: (2025)
Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon Simulation
by: Sun, Chenkai, et al.
Published: (2025)
by: Sun, Chenkai, et al.
Published: (2025)
IterResearch: Rethinking Long-Horizon Agents with Interaction Scaling
by: Chen, Guoxin, et al.
Published: (2025)
by: Chen, Guoxin, et al.
Published: (2025)
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios
by: Luo, Haotian, et al.
Published: (2025)
by: Luo, Haotian, et al.
Published: (2025)
Long-Context Long-Form Question Answering for Legal Domain
by: Kulkarni, Anagha, et al.
Published: (2026)
by: Kulkarni, Anagha, et al.
Published: (2026)
Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment
by: Bu, Yuyan, et al.
Published: (2026)
by: Bu, Yuyan, et al.
Published: (2026)
It's Not the Capability: Harness Sensitivity Is Non-Monotone Across LLM Agent Tiers
by: Cho, Yong-eun
Published: (2026)
by: Cho, Yong-eun
Published: (2026)
BenGER: Benchmarking LLM Systems on Subsumption-Based Legal Reasoning in German Law
by: Nagl, Sebastian, et al.
Published: (2026)
by: Nagl, Sebastian, et al.
Published: (2026)
Place Matters: Comparing LLM Hallucination Rates for Place-Based Legal Queries
by: Curran, Damian, et al.
Published: (2025)
by: Curran, Damian, et al.
Published: (2025)
The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval
by: Tong, Zekai, et al.
Published: (2026)
by: Tong, Zekai, et al.
Published: (2026)
MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory
by: Wang, Yuhui, et al.
Published: (2026)
by: Wang, Yuhui, et al.
Published: (2026)
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
by: Masala, Mihai, et al.
Published: (2024)
by: Masala, Mihai, et al.
Published: (2024)
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
Wide-Horizon Thinking and Simulation-Based Evaluation for Real-World LLM Planning with Multifaceted Constraints
by: Yang, Dongjie, et al.
Published: (2025)
by: Yang, Dongjie, et al.
Published: (2025)
Evaluating Long-Horizon Memory for Multi-Party Collaborative Dialogues
by: Hu, Chuanrui, et al.
Published: (2026)
by: Hu, Chuanrui, et al.
Published: (2026)
Connecting the Dots: Benchmarking Reflective Memory in Long-Horizon Dialogue
by: Lin, Jingjie, et al.
Published: (2026)
by: Lin, Jingjie, et al.
Published: (2026)
GTA: Generating Long-Horizon Tasks for Web Agents at Scale
by: Huang, Tenghao, et al.
Published: (2026)
by: Huang, Tenghao, et al.
Published: (2026)
Milestone-Guided Policy Learning for Long-Horizon Language Agents
by: Wang, Zixuan, et al.
Published: (2026)
by: Wang, Zixuan, et al.
Published: (2026)
COCORELI: Enforcing Execution Preconditions for Reliable Collaborative Instruction Following
by: Bhar, Swarnadeep, et al.
Published: (2025)
by: Bhar, Swarnadeep, et al.
Published: (2025)
Law in Silico: Simulating Legal Society with LLM-Based Agents
by: Wang, Yiding, et al.
Published: (2025)
by: Wang, Yiding, et al.
Published: (2025)
COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context
by: Wan, Guangya, et al.
Published: (2025)
by: Wan, Guangya, et al.
Published: (2025)
Temporal Preferences in Language Models for Long-Horizon Assistance
by: Mazyaki, Ali, et al.
Published: (2025)
by: Mazyaki, Ali, et al.
Published: (2025)
Guidelines for the Annotation and Visualization of Legal Argumentation Structures in Chinese Judicial Decisions
by: Chen, Kun, et al.
Published: (2026)
by: Chen, Kun, et al.
Published: (2026)
AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations
by: Jiayang, Cheng, et al.
Published: (2026)
by: Jiayang, Cheng, et al.
Published: (2026)
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
by: Wang, Tianle, et al.
Published: (2026)
by: Wang, Tianle, et al.
Published: (2026)
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints
by: Zhang, Yinger, et al.
Published: (2026)
by: Zhang, Yinger, et al.
Published: (2026)
GRAVITY: Architecture-Agnostic Structured Anchoring for Long-Horizon Conversational Memory
by: Sun, Yushi, et al.
Published: (2026)
by: Sun, Yushi, et al.
Published: (2026)
`Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory
by: Cardenas, Ronald, et al.
Published: (2024)
by: Cardenas, Ronald, et al.
Published: (2024)
Labeling Case Similarity based on Co-Citation of Legal Articles in Judgment Documents with Empirical Dispute-Based Evaluation
by: Liu, Chao-Lin, et al.
Published: (2025)
by: Liu, Chao-Lin, et al.
Published: (2025)
Learning and Enforcing Context-Sensitive Control for LLMs
by: Albinhassan, Mohammad, et al.
Published: (2026)
by: Albinhassan, Mohammad, et al.
Published: (2026)
Similar Items
-
Intrinsic Stability Limits of Autoregressive Reasoning: Structural Consequences for Long-Horizon Execution
by: Liao, Hsien-Jyh
Published: (2026) -
GPTs and Language Barrier: A Cross-Lingual Legal QA Examination
by: Nguyen, Ha-Thanh, et al.
Published: (2024) -
Causal Intervention-Based Memory Selection for Long-Horizon LLM Agents
by: Srivastava, Saksham Sahai
Published: (2026) -
Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model
by: Lin, Chun-Hsien, et al.
Published: (2024) -
Dynamic Long Short-Term Memory Based Memory Storage For Long Horizon LLM Interaction
by: Lou, Yuyang, et al.
Published: (2025)