Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yifeng, Chen, Jieming, Zhou, Tingguang, Duhan, Tanishq, Dong, Jianghong, Cao, Yuhong, Sartoretti, Guillaume
Format:	Preprint
Published:	2026
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2603.24931
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912982942875648
author	Zhang, Yifeng Chen, Jieming Zhou, Tingguang Duhan, Tanishq Dong, Jianghong Cao, Yuhong Sartoretti, Guillaume
author_facet	Zhang, Yifeng Chen, Jieming Zhou, Tingguang Duhan, Tanishq Dong, Jianghong Cao, Yuhong Sartoretti, Guillaume
contents	Multi-Agent Self-Driving (MASD) systems provide an effective solution for coordinating autonomous vehicles to reduce congestion and enhance both safety and operational efficiency in future intelligent transportation systems. Multi-Agent Reinforcement Learning (MARL) has emerged as a promising approach for developing advanced end-to-end MASD systems. However, achieving efficient and safe collaboration in dynamic MASD systems remains a significant challenge in dense scenarios with complex agent interactions. To address this challenge, we propose a novel collaborative(CO-) interaction-aware(-IN) MARL framework, named COIN. Specifically, we develop a new counterfactual individual-global twin delayed deep deterministic policy gradient (CIG-TD3) algorithm, crafted in a "centralized training, decentralized execution" (CTDE) manner, which aims to jointly optimize the individual objectives (navigation) and the global objectives (collaboration) of agents. We further introduce a dual-level interaction-aware centralized critic architecture that captures both local pairwise interactions and global system-level dependencies, enabling more accurate global value estimation and improved credit assignment for collaborative policy learning. We conduct extensive simulation experiments in dense urban traffic environments, which demonstrate that COIN consistently outperforms other advanced baseline methods in both safety and efficiency across various system sizes. These results highlight its superiority in complex and dynamic MASD scenarios, as further validated through real-world robot demonstrations. Supplementary videos are available at https://marmotlab.github.io/COIN/
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_24931
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	COIN: Collaborative Interaction-Aware Multi-Agent Reinforcement Learning for Self-Driving Systems Zhang, Yifeng Chen, Jieming Zhou, Tingguang Duhan, Tanishq Dong, Jianghong Cao, Yuhong Sartoretti, Guillaume Robotics Multi-Agent Self-Driving (MASD) systems provide an effective solution for coordinating autonomous vehicles to reduce congestion and enhance both safety and operational efficiency in future intelligent transportation systems. Multi-Agent Reinforcement Learning (MARL) has emerged as a promising approach for developing advanced end-to-end MASD systems. However, achieving efficient and safe collaboration in dynamic MASD systems remains a significant challenge in dense scenarios with complex agent interactions. To address this challenge, we propose a novel collaborative(CO-) interaction-aware(-IN) MARL framework, named COIN. Specifically, we develop a new counterfactual individual-global twin delayed deep deterministic policy gradient (CIG-TD3) algorithm, crafted in a "centralized training, decentralized execution" (CTDE) manner, which aims to jointly optimize the individual objectives (navigation) and the global objectives (collaboration) of agents. We further introduce a dual-level interaction-aware centralized critic architecture that captures both local pairwise interactions and global system-level dependencies, enabling more accurate global value estimation and improved credit assignment for collaborative policy learning. We conduct extensive simulation experiments in dense urban traffic environments, which demonstrate that COIN consistently outperforms other advanced baseline methods in both safety and efficiency across various system sizes. These results highlight its superiority in complex and dynamic MASD scenarios, as further validated through real-world robot demonstrations. Supplementary videos are available at https://marmotlab.github.io/COIN/
title	COIN: Collaborative Interaction-Aware Multi-Agent Reinforcement Learning for Self-Driving Systems
topic	Robotics
url	https://arxiv.org/abs/2603.24931

Similar Items