Saved in:
| Main Authors: | Fan, Shengda, Ye, Xuyan, Huo, Yupeng, Chen, Zhi-Yuan, Guo, Yiju, Yang, Shenzhi, Yang, Wenkai, Ye, Shuqi, Chen, Jingwen, Chen, Haotian, Cong, Xin, Lin, Yankai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.14465 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution
by: Fan, Shengda, et al.
Published: (2026)
by: Fan, Shengda, et al.
Published: (2026)
DeepCritic: Deliberate Critique with Large Language Models
by: Yang, Wenkai, et al.
Published: (2025)
by: Yang, Wenkai, et al.
Published: (2025)
AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation
by: Huo, Yupeng, et al.
Published: (2026)
by: Huo, Yupeng, et al.
Published: (2026)
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
by: Yang, Wenkai, et al.
Published: (2024)
by: Yang, Wenkai, et al.
Published: (2024)
Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning
by: Guo, Yiju, et al.
Published: (2025)
by: Guo, Yiju, et al.
Published: (2025)
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
by: Yang, Wenkai, et al.
Published: (2025)
by: Yang, Wenkai, et al.
Published: (2025)
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
by: Zhang, Zhong, et al.
Published: (2025)
by: Zhang, Zhong, et al.
Published: (2025)
La Resurrección del Público en la Autoría de los Procesos Creativos en los Espacios Procomunes
by: Chen Yiju
Published: (2013)
by: Chen Yiju
Published: (2013)
ProBench: Benchmarking GUI Agents with Accurate Process Information
by: Yang, Leyang, et al.
Published: (2025)
by: Yang, Leyang, et al.
Published: (2025)
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
by: Lu, Yaxi, et al.
Published: (2024)
by: Lu, Yaxi, et al.
Published: (2024)
Multi-Agent Reinforcement Learning with Communication-Constrained Priors
by: Yang, Guang, et al.
Published: (2025)
by: Yang, Guang, et al.
Published: (2025)
ProcCtrlBench: Evaluating Process-Level Defects and Control Preservation in LLM Coding Agents
by: He, Jiawei, et al.
Published: (2026)
by: He, Jiawei, et al.
Published: (2026)
Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification
by: Guo, Yiju, et al.
Published: (2026)
by: Guo, Yiju, et al.
Published: (2026)
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error
by: Yang, Shu-Xun, et al.
Published: (2025)
by: Yang, Shu-Xun, et al.
Published: (2025)
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice
by: Jiang, Cong, et al.
Published: (2024)
by: Jiang, Cong, et al.
Published: (2024)
AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents
by: Chen, Haotian, et al.
Published: (2026)
by: Chen, Haotian, et al.
Published: (2026)
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
by: Lu, Jiaxuan, et al.
Published: (2026)
by: Lu, Jiaxuan, et al.
Published: (2026)
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
Temporal Dynamics Decoupling with Inverse Processing for Enhancing Human Motion Prediction
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
by: Xiong, Weimin, et al.
Published: (2024)
by: Xiong, Weimin, et al.
Published: (2024)
Exploring Backdoor Vulnerabilities of Chat Models
by: Hao, Yunzhuo, et al.
Published: (2024)
by: Hao, Yunzhuo, et al.
Published: (2024)
AgentEscapeBench: Evaluating Out-of-Domain Tool-Grounded Reasoning in LLM Agents
by: Guo, Zhengkang, et al.
Published: (2026)
by: Guo, Zhengkang, et al.
Published: (2026)
COMPASS: Cognitive MCTS-Guided Process Alignment for Safe Search Agents
by: Shen, Wenkai, et al.
Published: (2026)
by: Shen, Wenkai, et al.
Published: (2026)
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization
by: Wang, Zhengcheng, et al.
Published: (2025)
by: Wang, Zhengcheng, et al.
Published: (2025)
Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents
by: Ye, Ye
Published: (2025)
by: Ye, Ye
Published: (2025)
Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents
by: Yang, Ruihan, et al.
Published: (2026)
by: Yang, Ruihan, et al.
Published: (2026)
AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration
by: Chen, Jizhou, et al.
Published: (2025)
by: Chen, Jizhou, et al.
Published: (2025)
Rational Decision-Making Agent with Internalized Utility Judgment
by: Ye, Yining, et al.
Published: (2023)
by: Ye, Yining, et al.
Published: (2023)
DebugBench: Evaluating Debugging Capability of Large Language Models
by: Tian, Runchu, et al.
Published: (2024)
by: Tian, Runchu, et al.
Published: (2024)
Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction
by: Bao, Han, et al.
Published: (2026)
by: Bao, Han, et al.
Published: (2026)
Task Memory Engine (TME): Enhancing State Awareness for Multi-Step LLM Agent Tasks
by: Ye, Ye
Published: (2025)
by: Ye, Ye
Published: (2025)
GUICourse: From General Vision Language Models to Versatile GUI Agents
by: Chen, Wentong, et al.
Published: (2024)
by: Chen, Wentong, et al.
Published: (2024)
Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents
by: Kang, Jiazheng, et al.
Published: (2026)
by: Kang, Jiazheng, et al.
Published: (2026)
AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators
by: Mazumder, Aritra, et al.
Published: (2026)
by: Mazumder, Aritra, et al.
Published: (2026)
Think-on-Process: Dynamic Process Generation for Collaborative Development of Multi-Agent System
by: Lin, Leilei, et al.
Published: (2024)
by: Lin, Leilei, et al.
Published: (2024)
InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents
by: Du, Yaxin, et al.
Published: (2025)
by: Du, Yaxin, et al.
Published: (2025)
STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft
by: Zhao, Zhonghan, et al.
Published: (2024)
by: Zhao, Zhonghan, et al.
Published: (2024)
GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis
by: Yu, Bo, et al.
Published: (2026)
by: Yu, Bo, et al.
Published: (2026)
Behind EvoMap: Characterizing a Self-Evolving Agent-to-Agent Collaboration Network
by: Ye, Qiming, et al.
Published: (2026)
by: Ye, Qiming, et al.
Published: (2026)
DirectEdit: Step-Level Accurate Inversion for Flow-Based Image Editing
by: Yang, Desong, et al.
Published: (2026)
by: Yang, Desong, et al.
Published: (2026)
Similar Items
-
DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution
by: Fan, Shengda, et al.
Published: (2026) -
DeepCritic: Deliberate Critique with Large Language Models
by: Yang, Wenkai, et al.
Published: (2025) -
AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation
by: Huo, Yupeng, et al.
Published: (2026) -
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
by: Yang, Wenkai, et al.
Published: (2024) -
Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning
by: Guo, Yiju, et al.
Published: (2025)