Saved in:
| Main Authors: | Jiang, Tanqiu, Wang, Yuhui, Liang, Jiacheng, Wang, Ting |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.16901 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory
by: Wang, Yuhui, et al.
Published: (2026)
by: Wang, Yuhui, et al.
Published: (2026)
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
by: Jiang, Tanqiu, et al.
Published: (2024)
by: Jiang, Tanqiu, et al.
Published: (2024)
RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models
by: Liang, Jiacheng, et al.
Published: (2026)
by: Liang, Jiacheng, et al.
Published: (2026)
GraphRAG under Fire
by: Liang, Jiacheng, et al.
Published: (2025)
by: Liang, Jiacheng, et al.
Published: (2025)
Your Agent Can Defend Itself against Backdoor Attacks
by: Changjiang, Li, et al.
Published: (2025)
by: Changjiang, Li, et al.
Published: (2025)
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios
by: Luo, Haotian, et al.
Published: (2025)
by: Luo, Haotian, et al.
Published: (2025)
HiMem: Hierarchical Long-Term Memory for LLM Long-Horizon Agents
by: Zhang, Ningning, et al.
Published: (2026)
by: Zhang, Ningning, et al.
Published: (2026)
Can LLM Agents Be CFOs? Benchmarking Long-Horizon Resource Allocation in an Uncertain Enterprise Environment
by: Han, Yi, et al.
Published: (2026)
by: Han, Yi, et al.
Published: (2026)
AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management
by: Tian, Shizuo, et al.
Published: (2025)
by: Tian, Shizuo, et al.
Published: (2025)
Cross-Modal Content Optimization for Steering Web Agent Preferences
by: Jiang, Tanqiu, et al.
Published: (2025)
by: Jiang, Tanqiu, et al.
Published: (2025)
Can LLM Agents Sustain Long-Horizon Organizational Dynamics?
by: Zhu, Xuancheng, et al.
Published: (2026)
by: Zhu, Xuancheng, et al.
Published: (2026)
Parallel Context Compaction for Long-Horizon LLM Agent Serving
by: Cim, Musa, et al.
Published: (2026)
by: Cim, Musa, et al.
Published: (2026)
Hindsight Credit Assignment for Long-Horizon LLM Agents
by: Tan, Hui-Ze, et al.
Published: (2026)
by: Tan, Hui-Ze, et al.
Published: (2026)
Continuum Memory Architectures for Long-Horizon LLM Agents
by: Logan, Joe
Published: (2026)
by: Logan, Joe
Published: (2026)
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
by: Zhang, Hanrong, et al.
Published: (2024)
by: Zhang, Hanrong, et al.
Published: (2024)
Reinforcement Learning for Long-Horizon Interactive LLM Agents
by: Chen, Kevin, et al.
Published: (2025)
by: Chen, Kevin, et al.
Published: (2025)
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents
by: Wang, Taiyi, et al.
Published: (2026)
by: Wang, Taiyi, et al.
Published: (2026)
CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent
by: Ning, Liang-bo, et al.
Published: (2025)
by: Ning, Liang-bo, et al.
Published: (2025)
Self-Improving Model Steering
by: Zhu, Rongyi, et al.
Published: (2025)
by: Zhu, Rongyi, et al.
Published: (2025)
Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models
by: Wang, Yuhui, et al.
Published: (2025)
by: Wang, Yuhui, et al.
Published: (2025)
LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents
by: Lu, Yijun, et al.
Published: (2026)
by: Lu, Yijun, et al.
Published: (2026)
ColorBench: Benchmarking Mobile Agents with Graph-Structured Framework for Complex Long-Horizon Tasks
by: Song, Yuanyi, et al.
Published: (2025)
by: Song, Yuanyi, et al.
Published: (2025)
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios
by: Shen, Yuanzhe, et al.
Published: (2026)
by: Shen, Yuanzhe, et al.
Published: (2026)
LongDA: Benchmarking LLM Agents for Long-Document Data Analysis
by: Li, Yiyang, et al.
Published: (2026)
by: Li, Yiyang, et al.
Published: (2026)
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
by: Li, Junlong, et al.
Published: (2025)
by: Li, Junlong, et al.
Published: (2025)
AgentFold: Long-Horizon Web Agents with Proactive Context Management
by: Ye, Rui, et al.
Published: (2025)
by: Ye, Rui, et al.
Published: (2025)
CoMIC: Collaborative Memory and Insights Circulation for Long-Horizon LLM Agents in Cloud-Edge Systems
by: Wang, Yannan, et al.
Published: (2026)
by: Wang, Yannan, et al.
Published: (2026)
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
by: Wu, Xiyang, et al.
Published: (2026)
by: Wu, Xiyang, et al.
Published: (2026)
Causal Intervention-Based Memory Selection for Long-Horizon LLM Agents
by: Srivastava, Saksham Sahai
Published: (2026)
by: Srivastava, Saksham Sahai
Published: (2026)
The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas
by: Xu, Baixuan, et al.
Published: (2025)
by: Xu, Baixuan, et al.
Published: (2025)
TRACE: Trajectory Risk-Aware Compression for Long-Horizon Agent Safety
by: Hong, Zhepei, et al.
Published: (2026)
by: Hong, Zhepei, et al.
Published: (2026)
AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
by: Hu, Yuyang, et al.
Published: (2026)
by: Hu, Yuyang, et al.
Published: (2026)
SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent
by: Hu, Yuyang, et al.
Published: (2026)
by: Hu, Yuyang, et al.
Published: (2026)
Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
by: Gao, Heyang, et al.
Published: (2025)
by: Gao, Heyang, et al.
Published: (2025)
When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents
by: Yan, Lu, et al.
Published: (2026)
by: Yan, Lu, et al.
Published: (2026)
AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents
by: Chen, Haotian, et al.
Published: (2026)
by: Chen, Haotian, et al.
Published: (2026)
Aeon: High-Performance Neuro-Symbolic Memory Management for Long-Horizon LLM Agents
by: Arslan, Mustafa
Published: (2026)
by: Arslan, Mustafa
Published: (2026)
Beyond pass@1: A Reliability Science Framework for Long-Horizon LLM Agents
by: Khanal, Aaditya, et al.
Published: (2026)
by: Khanal, Aaditya, et al.
Published: (2026)
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
by: Qiu, Jielin, et al.
Published: (2025)
by: Qiu, Jielin, et al.
Published: (2025)
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents
by: Feng, Zhaopeng, et al.
Published: (2026)
by: Feng, Zhaopeng, et al.
Published: (2026)
Similar Items
-
MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory
by: Wang, Yuhui, et al.
Published: (2026) -
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
by: Jiang, Tanqiu, et al.
Published: (2024) -
RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models
by: Liang, Jiacheng, et al.
Published: (2026) -
GraphRAG under Fire
by: Liang, Jiacheng, et al.
Published: (2025) -
Your Agent Can Defend Itself against Backdoor Attacks
by: Changjiang, Li, et al.
Published: (2025)