Saved in:
| Main Authors: | Kirchdorfer, Lukas, Doumeni, Artemis, van der Aa, Han, López, Hugo A. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.01857 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AgentSimulator: An Agent-based Approach for Data-driven Business Process Simulation
by: Kirchdorfer, Lukas, et al.
Published: (2024)
by: Kirchdorfer, Lukas, et al.
Published: (2024)
Generalized Per-Agent Advantage Estimation for Multi-Agent Policy Optimization
by: Kim, Seongmin, et al.
Published: (2026)
by: Kim, Seongmin, et al.
Published: (2026)
Multi-Agent Guided Policy Optimization
by: Li, Yueheng, et al.
Published: (2025)
by: Li, Yueheng, et al.
Published: (2025)
Policy Optimization in Multi-Agent Settings under Partially Observable Environments
by: Zhaikhan, Ainur, et al.
Published: (2025)
by: Zhaikhan, Ainur, et al.
Published: (2025)
MARPO: A Reflective Policy Optimization for Multi Agent Reinforcement Learning
by: Wu, Cuiling, et al.
Published: (2025)
by: Wu, Cuiling, et al.
Published: (2025)
Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning
by: Dai, Pengcheng, et al.
Published: (2025)
by: Dai, Pengcheng, et al.
Published: (2025)
Safe Equilibrium Policy Optimization for Strategic Agent Policies
by: Arumugam, Karthika, et al.
Published: (2026)
by: Arumugam, Karthika, et al.
Published: (2026)
Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
by: Zhang, Qixin, et al.
Published: (2025)
by: Zhang, Qixin, et al.
Published: (2025)
Unification of Consensus-Based Multi-Objective Optimization and Multi-Robot Path Planning
by: Wozniak, Michael P.
Published: (2025)
by: Wozniak, Michael P.
Published: (2025)
B2MAPO: A Batch-by-Batch Multi-Agent Policy Optimization to Balance Performance and Efficiency
by: Zhang, Wenjing, et al.
Published: (2024)
by: Zhang, Wenjing, et al.
Published: (2024)
MACH: Multi-Agent Coordination for RSU-centric Handovers
by: Spring, Nikolaus, et al.
Published: (2025)
by: Spring, Nikolaus, et al.
Published: (2025)
DTPPO: Dual-Transformer Encoder-based Proximal Policy Optimization for Multi-UAV Navigation in Unseen Complex Environments
by: Wei, Anning, et al.
Published: (2024)
by: Wei, Anning, et al.
Published: (2024)
From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination
by: Yao, Chang, et al.
Published: (2025)
by: Yao, Chang, et al.
Published: (2025)
Heterogeneous Value Decomposition Policy Fusion for Multi-Agent Cooperation
by: Wang, Siying, et al.
Published: (2025)
by: Wang, Siying, et al.
Published: (2025)
Enwar 3.0: An Agentic Multi-Modal LLM Orchestrator for Situation-Aware Beamforming, Blockage Prediction, and Handover Management
by: Nazar, Ahmad M., et al.
Published: (2026)
by: Nazar, Ahmad M., et al.
Published: (2026)
Symmetric Policy Design for Multi-Agent Dispatch Coordination in Supply Chains
by: Sudhakara, Sagar
Published: (2025)
by: Sudhakara, Sagar
Published: (2025)
Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions
by: Soudijani, Saleh, et al.
Published: (2025)
by: Soudijani, Saleh, et al.
Published: (2025)
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
by: Fu, Yuqian, et al.
Published: (2024)
by: Fu, Yuqian, et al.
Published: (2024)
Counterfactual Multi-Agent Policy Gradients
by: Foerster, Jakob, et al.
Published: (2017)
by: Foerster, Jakob, et al.
Published: (2017)
Optimistic Multi-Agent Policy Gradient
by: Zhao, Wenshuai, et al.
Published: (2023)
by: Zhao, Wenshuai, et al.
Published: (2023)
PolicySimEval: A Benchmark for Evaluating Policy Outcomes through Agent-Based Simulation
by: Kang, Jiaju, et al.
Published: (2025)
by: Kang, Jiaju, et al.
Published: (2025)
Co-Optimizing Reconfigurable Environments and Policies for Decentralized Multi-Agent Navigation
by: Gao, Zhan, et al.
Published: (2024)
by: Gao, Zhan, et al.
Published: (2024)
Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs
by: Yang, Wei, et al.
Published: (2025)
by: Yang, Wei, et al.
Published: (2025)
MAPPO-LCR: Multi-Agent Proximal Policy Optimization with Local Cooperation Reward in Spatial Public Goods Games
by: Yang, Zhaoqilin, et al.
Published: (2025)
by: Yang, Zhaoqilin, et al.
Published: (2025)
A Model for Multi-Agent Autonomy That Uses Opinion Dynamics and Multi-Objective Behavior Optimization
by: Paine, Tyler M., et al.
Published: (2023)
by: Paine, Tyler M., et al.
Published: (2023)
MALBO: Optimizing LLM-Based Multi-Agent Teams via Multi-Objective Bayesian Optimization
by: Sabbatella, Antonio
Published: (2025)
by: Sabbatella, Antonio
Published: (2025)
Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization
by: Fan, Yijia, et al.
Published: (2025)
by: Fan, Yijia, et al.
Published: (2025)
DHLight: Multi-agent Policy-based Directed Hypergraph Learning for Traffic Signal Control
by: Lei, Zhen, et al.
Published: (2024)
by: Lei, Zhen, et al.
Published: (2024)
Multi-Agent Systems Should be Treated as Principal-Agent Problems
by: Rauba, Paulius, et al.
Published: (2026)
by: Rauba, Paulius, et al.
Published: (2026)
Improving Learnt Local MAPF Policies with Heuristic Search
by: Veerapaneni, Rishi, et al.
Published: (2024)
by: Veerapaneni, Rishi, et al.
Published: (2024)
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
by: Zhang, Beining, et al.
Published: (2025)
by: Zhang, Beining, et al.
Published: (2025)
The Value of Variance: Mitigating Debate Collapse in Multi-Agent Systems via Uncertainty-Driven Policy Optimization
by: Tang, Luoxi, et al.
Published: (2026)
by: Tang, Luoxi, et al.
Published: (2026)
Network-Constrained Policy Optimization for Adaptive Multi-agent Vehicle Routing
by: Arasteh, Fazel, et al.
Published: (2025)
by: Arasteh, Fazel, et al.
Published: (2025)
Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation
by: Bezerra, Lucas C. D., et al.
Published: (2024)
by: Bezerra, Lucas C. D., et al.
Published: (2024)
Generalization of Heterogeneous Multi-Robot Policies via Awareness and Communication of Capabilities
by: Howell, Pierce, et al.
Published: (2024)
by: Howell, Pierce, et al.
Published: (2024)
A Historical Interaction-Enhanced Shapley Policy Gradient Algorithm for Multi-Agent Credit Assignment
by: Ding, Ao, et al.
Published: (2025)
by: Ding, Ao, et al.
Published: (2025)
Measuring Policy Distance for Multi-Agent Reinforcement Learning
by: Hu, Tianyi, et al.
Published: (2024)
by: Hu, Tianyi, et al.
Published: (2024)
AgentMixer: Multi-Agent Correlated Policy Factorization
by: Li, Zhiyuan, et al.
Published: (2024)
by: Li, Zhiyuan, et al.
Published: (2024)
Conformal Off-Policy Prediction for Multi-Agent Systems
by: Kuipers, Tom, et al.
Published: (2024)
by: Kuipers, Tom, et al.
Published: (2024)
Fairness Aware Reinforcement Learning via Proximal Policy Optimization
by: La Malfa, Gabriele, et al.
Published: (2025)
by: La Malfa, Gabriele, et al.
Published: (2025)
Similar Items
-
AgentSimulator: An Agent-based Approach for Data-driven Business Process Simulation
by: Kirchdorfer, Lukas, et al.
Published: (2024) -
Generalized Per-Agent Advantage Estimation for Multi-Agent Policy Optimization
by: Kim, Seongmin, et al.
Published: (2026) -
Multi-Agent Guided Policy Optimization
by: Li, Yueheng, et al.
Published: (2025) -
Policy Optimization in Multi-Agent Settings under Partially Observable Environments
by: Zhaikhan, Ainur, et al.
Published: (2025) -
MARPO: A Reflective Policy Optimization for Multi Agent Reinforcement Learning
by: Wu, Cuiling, et al.
Published: (2025)