Saved in:
| Main Authors: | Chen, Xudong, Liu, Yixin, Wei, Hua, Ding, Kaize |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.14483 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays
by: Deng, Kaize, et al.
Published: (2026)
by: Deng, Kaize, et al.
Published: (2026)
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2022)
by: Chen, Yiqun, et al.
Published: (2022)
GNN-as-Judge: Unleashing the Power of LLMs for Graph Learning with GNN Feedback
by: Xu, Ruiyao, et al.
Published: (2026)
by: Xu, Ruiyao, et al.
Published: (2026)
MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization
by: Wang, Ziqing, et al.
Published: (2026)
by: Wang, Ziqing, et al.
Published: (2026)
Reinforcement Learning for Autonomous Warehouse Orchestration in SAP Logistics Execution: Redefining Supply Chain Agility
by: Pillella, Sumanth
Published: (2025)
by: Pillella, Sumanth
Published: (2025)
POLO: Preference-Guided Multi-Turn Reinforcement Learning for Lead Optimization
by: Wang, Ziqing, et al.
Published: (2025)
by: Wang, Ziqing, et al.
Published: (2025)
Explainable and Fine-Grained Safeguarding of LLM Multi-Agent Systems via Bi-Level Graph Anomaly Detection
by: Pan, Junjun, et al.
Published: (2025)
by: Pan, Junjun, et al.
Published: (2025)
Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration
by: Guo, Liangxuan, et al.
Published: (2025)
by: Guo, Liangxuan, et al.
Published: (2025)
EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems
by: Xu, Chengdong, et al.
Published: (2026)
by: Xu, Chengdong, et al.
Published: (2026)
Learning Spatiotemporal Sensitivity in Video LLMs via Counterfactual Reinforcement Learning
by: Du, Dazhao, et al.
Published: (2026)
by: Du, Dazhao, et al.
Published: (2026)
Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction
by: Zhe, Tao, et al.
Published: (2026)
by: Zhe, Tao, et al.
Published: (2026)
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
by: Feng, Lang, et al.
Published: (2025)
by: Feng, Lang, et al.
Published: (2025)
Multi-Agent Collaboration via Evolving Orchestration
by: Dang, Yufan, et al.
Published: (2025)
by: Dang, Yufan, et al.
Published: (2025)
Small Model as Master Orchestrator: Learning Unified Agent-Tool Orchestration with Parallel Subtask Decomposition
by: Yuan, Wenzhen, et al.
Published: (2026)
by: Yuan, Wenzhen, et al.
Published: (2026)
Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution
by: Zhang, Xing, et al.
Published: (2026)
by: Zhang, Xing, et al.
Published: (2026)
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)
by: Liu, Zongkai, et al.
Published: (2024)
Beyond Sharp Minima: Robust LLM Unlearning via Feedback-Guided Multi-Point Optimization
by: Wu, Wenhan, et al.
Published: (2025)
by: Wu, Wenhan, et al.
Published: (2025)
Multi-Agent Collaboration via Cross-Team Orchestration
by: Du, Zhuoyun, et al.
Published: (2024)
by: Du, Zhuoyun, et al.
Published: (2024)
Concept Learning for Cooperative Multi-Agent Reinforcement Learning
by: Ge, Zhonghan, et al.
Published: (2025)
by: Ge, Zhonghan, et al.
Published: (2025)
Towards Scalable Lightweight GUI Agents via Multi-role Orchestration
by: Wang, Ziwei, et al.
Published: (2026)
by: Wang, Ziwei, et al.
Published: (2026)
AgentInit: Initializing LLM-based Multi-Agent Systems via Diversity and Expertise Orchestration for Effective and Efficient Collaboration
by: Tian, Chunhao, et al.
Published: (2025)
by: Tian, Chunhao, et al.
Published: (2025)
Hierarchical Memory Orchestration for Personalized Persistent Agents
by: Liu, Junming, et al.
Published: (2026)
by: Liu, Junming, et al.
Published: (2026)
Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2026)
by: Chen, Yiqun, et al.
Published: (2026)
Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration
by: Guo, Xudong, et al.
Published: (2024)
by: Guo, Xudong, et al.
Published: (2024)
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
by: Pang, Jing-Cheng, et al.
Published: (2021)
by: Pang, Jing-Cheng, et al.
Published: (2021)
LLM Collaboration With Multi-Agent Reinforcement Learning
by: Liu, Shuo, et al.
Published: (2025)
by: Liu, Shuo, et al.
Published: (2025)
Explaining Reinforcement Learning: A Counterfactual Shapley Values Approach
by: Shi, Yiwei, et al.
Published: (2024)
by: Shi, Yiwei, et al.
Published: (2024)
Multi-Agent Coordination Adaptation via Structure-Guided Orchestration
by: Li, Haoran, et al.
Published: (2026)
by: Li, Haoran, et al.
Published: (2026)
Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
by: Li, Zhongyi, et al.
Published: (2026)
by: Li, Zhongyi, et al.
Published: (2026)
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
by: Li, Binxu, et al.
Published: (2024)
by: Li, Binxu, et al.
Published: (2024)
Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems
by: Shi, Xi, et al.
Published: (2026)
by: Shi, Xi, et al.
Published: (2026)
Heterogeneity in Multi-Agent Reinforcement Learning
by: Hu, Tianyi, et al.
Published: (2025)
by: Hu, Tianyi, et al.
Published: (2025)
OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
Quantum-Train-Based Distributed Multi-Agent Reinforcement Learning
by: Chen, Kuan-Cheng, et al.
Published: (2024)
by: Chen, Kuan-Cheng, et al.
Published: (2024)
Robust and Efficient Communication in Multi-Agent Reinforcement Learning
by: Liu, Zejiao, et al.
Published: (2025)
by: Liu, Zejiao, et al.
Published: (2025)
Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning
by: Chen, Jianming, et al.
Published: (2024)
by: Chen, Jianming, et al.
Published: (2024)
Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning
by: Liao, Wei-Chen, et al.
Published: (2025)
by: Liao, Wei-Chen, et al.
Published: (2025)
Energy-Aware Multi-Agent Reinforcement Learning for Collaborative Execution in Mission-Oriented Drone Networks
by: Li, Ying, et al.
Published: (2024)
by: Li, Ying, et al.
Published: (2024)
Counterfactual Explanations for Continuous Action Reinforcement Learning
by: Dong, Shuyang, et al.
Published: (2025)
by: Dong, Shuyang, et al.
Published: (2025)
Similar Items
-
Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays
by: Deng, Kaize, et al.
Published: (2026) -
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2022) -
GNN-as-Judge: Unleashing the Power of LLMs for Graph Learning with GNN Feedback
by: Xu, Ruiyao, et al.
Published: (2026) -
MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization
by: Wang, Ziqing, et al.
Published: (2026) -
Reinforcement Learning for Autonomous Warehouse Orchestration in SAP Logistics Execution: Redefining Supply Chain Agility
by: Pillella, Sumanth
Published: (2025)