:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Xudong, Liu, Yixin, Wei, Hua, Ding, Kaize
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.14483
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays
by: Deng, Kaize, et al.
Published: (2026)

PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2022)

GNN-as-Judge: Unleashing the Power of LLMs for Graph Learning with GNN Feedback
by: Xu, Ruiyao, et al.
Published: (2026)

MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization
by: Wang, Ziqing, et al.
Published: (2026)

Reinforcement Learning for Autonomous Warehouse Orchestration in SAP Logistics Execution: Redefining Supply Chain Agility
by: Pillella, Sumanth
Published: (2025)

POLO: Preference-Guided Multi-Turn Reinforcement Learning for Lead Optimization
by: Wang, Ziqing, et al.
Published: (2025)

Explainable and Fine-Grained Safeguarding of LLM Multi-Agent Systems via Bi-Level Graph Anomaly Detection
by: Pan, Junjun, et al.
Published: (2025)

Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration
by: Guo, Liangxuan, et al.
Published: (2025)

EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems
by: Xu, Chengdong, et al.
Published: (2026)

Learning Spatiotemporal Sensitivity in Video LLMs via Counterfactual Reinforcement Learning
by: Du, Dazhao, et al.
Published: (2026)

Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction
by: Zhe, Tao, et al.
Published: (2026)

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
by: Feng, Lang, et al.
Published: (2025)

Multi-Agent Collaboration via Evolving Orchestration
by: Dang, Yufan, et al.
Published: (2025)

Small Model as Master Orchestrator: Learning Unified Agent-Tool Orchestration with Parallel Subtask Decomposition
by: Yuan, Wenzhen, et al.
Published: (2026)

Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution
by: Zhang, Xing, et al.
Published: (2026)

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)

Beyond Sharp Minima: Robust LLM Unlearning via Feedback-Guided Multi-Point Optimization
by: Wu, Wenhan, et al.
Published: (2025)

Multi-Agent Collaboration via Cross-Team Orchestration
by: Du, Zhuoyun, et al.
Published: (2024)

Concept Learning for Cooperative Multi-Agent Reinforcement Learning
by: Ge, Zhonghan, et al.
Published: (2025)

Towards Scalable Lightweight GUI Agents via Multi-role Orchestration
by: Wang, Ziwei, et al.
Published: (2026)

AgentInit: Initializing LLM-based Multi-Agent Systems via Diversity and Expertise Orchestration for Effective and Efficient Collaboration
by: Tian, Chunhao, et al.
Published: (2025)

Hierarchical Memory Orchestration for Personalized Persistent Agents
by: Liu, Junming, et al.
Published: (2026)

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2026)

Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration
by: Guo, Xudong, et al.
Published: (2024)

Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
by: Pang, Jing-Cheng, et al.
Published: (2021)

LLM Collaboration With Multi-Agent Reinforcement Learning
by: Liu, Shuo, et al.
Published: (2025)

Explaining Reinforcement Learning: A Counterfactual Shapley Values Approach
by: Shi, Yiwei, et al.
Published: (2024)

Multi-Agent Coordination Adaptation via Structure-Guided Orchestration
by: Li, Haoran, et al.
Published: (2026)

Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
by: Li, Zhongyi, et al.
Published: (2026)

MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
by: Li, Binxu, et al.
Published: (2024)

Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems
by: Shi, Xi, et al.
Published: (2026)

Heterogeneity in Multi-Agent Reinforcement Learning
by: Hu, Tianyi, et al.
Published: (2025)

OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval
by: Liu, Yu, et al.
Published: (2025)

xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)

Quantum-Train-Based Distributed Multi-Agent Reinforcement Learning
by: Chen, Kuan-Cheng, et al.
Published: (2024)

Robust and Efficient Communication in Multi-Agent Reinforcement Learning
by: Liu, Zejiao, et al.
Published: (2025)

Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning
by: Chen, Jianming, et al.
Published: (2024)

Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning
by: Liao, Wei-Chen, et al.
Published: (2025)

Energy-Aware Multi-Agent Reinforcement Learning for Collaborative Execution in Mission-Oriented Drone Networks
by: Li, Ying, et al.
Published: (2024)

Counterfactual Explanations for Continuous Action Reinforcement Learning
by: Dong, Shuyang, et al.
Published: (2025)