Guardado en:
| Autores principales: | Liu, Zejiao, Li, Yi, Wang, Jiali, Tu, Junqi, Hong, Yitian, Li, Fangfei, Liu, Yang, Sugawara, Toshiharu, Tang, Yang |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2511.11393 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement Learning
por: Liu, Zejiao, et al.
Publicado: (2025)
por: Liu, Zejiao, et al.
Publicado: (2025)
Reducing Redundant Computation in Multi-Agent Coordination through Locally Centralized Execution
por: Bai, Yidong, et al.
Publicado: (2024)
por: Bai, Yidong, et al.
Publicado: (2024)
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
por: Li, Simin, et al.
Publicado: (2023)
por: Li, Simin, et al.
Publicado: (2023)
Evolutionary Enhanced Multi-Agent Reinforcement Learning for Cooperative Air Combat
por: Li, Chengwei, et al.
Publicado: (2026)
por: Li, Chengwei, et al.
Publicado: (2026)
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
por: Yu, Song, et al.
Publicado: (2024)
por: Yu, Song, et al.
Publicado: (2024)
Adaptive Robust Estimator for Multi-Agent Reinforcement Learning
por: Li, Zhongyi, et al.
Publicado: (2026)
por: Li, Zhongyi, et al.
Publicado: (2026)
Multi-Agent Reinforcement Learning with Communication-Constrained Priors
por: Yang, Guang, et al.
Publicado: (2025)
por: Yang, Guang, et al.
Publicado: (2025)
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
por: Li, Simin, et al.
Publicado: (2025)
por: Li, Simin, et al.
Publicado: (2025)
Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning
por: Wang, Dayu, et al.
Publicado: (2025)
por: Wang, Dayu, et al.
Publicado: (2025)
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
por: Liu, Shulin, et al.
Publicado: (2025)
por: Liu, Shulin, et al.
Publicado: (2025)
Interference-Aware K-Step Reachable Communication in Multi-Agent Reinforcement Learning
por: Cheng, Ziyu, et al.
Publicado: (2026)
por: Cheng, Ziyu, et al.
Publicado: (2026)
Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System
por: Zhao, Yang, et al.
Publicado: (2024)
por: Zhao, Yang, et al.
Publicado: (2024)
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
por: Zhang, Zijing, et al.
Publicado: (2025)
por: Zhang, Zijing, et al.
Publicado: (2025)
Multi-Agent Collaborative Reward Design for Enhancing Reasoning in Reinforcement Learning
por: Yang, Pei, et al.
Publicado: (2025)
por: Yang, Pei, et al.
Publicado: (2025)
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary Prospects, and Challenges
por: Zhou, Ziyuan, et al.
Publicado: (2023)
por: Zhou, Ziyuan, et al.
Publicado: (2023)
Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning
por: Li, Simin, et al.
Publicado: (2025)
por: Li, Simin, et al.
Publicado: (2025)
Partially Observable Mean Field Multi-Agent Reinforcement Learning Based on Graph-Attention
por: Yang, Min, et al.
Publicado: (2023)
por: Yang, Min, et al.
Publicado: (2023)
A Robust and Efficient Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
por: Huang, Sheng-You, et al.
Publicado: (2026)
por: Huang, Sheng-You, et al.
Publicado: (2026)
Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment
por: Hou, Xiaoyang, et al.
Publicado: (2026)
por: Hou, Xiaoyang, et al.
Publicado: (2026)
Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding
por: Zhou, Yuhang, et al.
Publicado: (2025)
por: Zhou, Yuhang, et al.
Publicado: (2025)
Rethinking the Design of Reinforcement Learning-Based Deep Research Agents
por: Wan, Yi, et al.
Publicado: (2025)
por: Wan, Yi, et al.
Publicado: (2025)
Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning
por: Li, Xiaozhe, et al.
Publicado: (2026)
por: Li, Xiaozhe, et al.
Publicado: (2026)
Enhancing Cloud Network Resilience via a Robust LLM-Empowered Multi-Agent Reinforcement Learning Framework
por: Peng, Yixiao, et al.
Publicado: (2026)
por: Peng, Yixiao, et al.
Publicado: (2026)
ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering
por: Yuan, Zhuowen, et al.
Publicado: (2025)
por: Yuan, Zhuowen, et al.
Publicado: (2025)
Causal Mean Field Multi-Agent Reinforcement Learning
por: Ma, Hao, et al.
Publicado: (2025)
por: Ma, Hao, et al.
Publicado: (2025)
Cross-Modal Navigation with Multi-Agent Reinforcement Learning
por: Liu, Shuo, et al.
Publicado: (2026)
por: Liu, Shuo, et al.
Publicado: (2026)
Tool-RoCo: An Agent-as-Tool Self-organization Large Language Model Benchmark in Multi-robot Cooperation
por: Zhang, Ke, et al.
Publicado: (2025)
por: Zhang, Ke, et al.
Publicado: (2025)
Debate over Mixed-knowledge: A Robust Multi-Agent Reasoning Framework for Incomplete Knowledge Graph Question Answering
por: Liu, Jilong, et al.
Publicado: (2025)
por: Liu, Jilong, et al.
Publicado: (2025)
MGCBS: An Optimal and Efficient Algorithm for Solving Multi-Goal Multi-Agent Path Finding Problem
por: Tang, Mingkai, et al.
Publicado: (2024)
por: Tang, Mingkai, et al.
Publicado: (2024)
DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs
por: Li, Yuanhao, et al.
Publicado: (2025)
por: Li, Yuanhao, et al.
Publicado: (2025)
Multi-Agent Deep Reinforcement Learning Under Constrained Communications
por: Shaik, Shahil, et al.
Publicado: (2026)
por: Shaik, Shahil, et al.
Publicado: (2026)
Efficient Multi-agent Reinforcement Learning by Planning
por: Liu, Qihan, et al.
Publicado: (2024)
por: Liu, Qihan, et al.
Publicado: (2024)
ProActor: Timing-Aware Reinforcement Learning for Proactive Task Scheduling Agents
por: Ding, Lei, et al.
Publicado: (2026)
por: Ding, Lei, et al.
Publicado: (2026)
Evo-MARL: Co-Evolutionary Multi-Agent Reinforcement Learning for Internalized Safety
por: Pan, Zhenyu, et al.
Publicado: (2025)
por: Pan, Zhenyu, et al.
Publicado: (2025)
Efficient and Stable Reinforcement Learning for Diffusion Language Models
por: Liu, Jiawei, et al.
Publicado: (2026)
por: Liu, Jiawei, et al.
Publicado: (2026)
Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization
por: Fan, Yijia, et al.
Publicado: (2025)
por: Fan, Yijia, et al.
Publicado: (2025)
Efficient Training in Multi-Agent Reinforcement Learning: A Communication-Free Framework for the Box-Pushing Problem
por: Ge, David, et al.
Publicado: (2024)
por: Ge, David, et al.
Publicado: (2024)
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning
por: Liu, Zeyuan, et al.
Publicado: (2025)
por: Liu, Zeyuan, et al.
Publicado: (2025)
DeepRefine: Agent-Compiled Knowledge Refinement via Reinforcement Learning
por: Huang, Haoyu, et al.
Publicado: (2026)
por: Huang, Haoyu, et al.
Publicado: (2026)
MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning
por: Liu, Jie, et al.
Publicado: (2023)
por: Liu, Jie, et al.
Publicado: (2023)
Ejemplares similares
-
HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement Learning
por: Liu, Zejiao, et al.
Publicado: (2025) -
Reducing Redundant Computation in Multi-Agent Coordination through Locally Centralized Execution
por: Bai, Yidong, et al.
Publicado: (2024) -
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
por: Li, Simin, et al.
Publicado: (2023) -
Evolutionary Enhanced Multi-Agent Reinforcement Learning for Cooperative Air Combat
por: Li, Chengwei, et al.
Publicado: (2026) -
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
por: Yu, Song, et al.
Publicado: (2024)