תוכן הענינים: :: Library Catalog

שמור ב:

מידע ביבליוגרפי
Main Authors:	Seo, Sangwon, Unhelkar, Vaibhav
פורמט:	Preprint
יצא לאור:	2025
נושאים:	Machine Learning Artificial Intelligence Multiagent Systems
גישה מקוונת:	https://arxiv.org/abs/2502.17618
תגים:	הוספת תג אין תגיות, היה/י הראשונ/ה לתייג את הרשומה!

תוכן הענינים:

Successful collaboration requires team members to stay aligned, especially in complex sequential tasks. Team members must dynamically coordinate which subtasks to perform and in what order. However, real-world constraints like partial observability and limited communication bandwidth often lead to suboptimal collaboration. Even among expert teams, the same task can be executed in multiple ways. To develop multi-agent systems and human-AI teams for such tasks, we are interested in data-driven learning of multimodal team behaviors. Multi-Agent Imitation Learning (MAIL) provides a promising framework for data-driven learning of team behavior from demonstrations, but existing methods struggle with heterogeneous demonstrations, as they assume that all demonstrations originate from a single team policy. Hence, in this work, we introduce DTIL: a hierarchical MAIL algorithm designed to learn multimodal team behaviors in complex sequential tasks. DTIL represents each team member with a hierarchical policy and learns these policies from heterogeneous team demonstrations in a factored manner. By employing a distribution-matching approach, DTIL mitigates compounding errors and scales effectively to long horizons and continuous state representations. Experimental results show that DTIL outperforms MAIL baselines and accurately models team behavior across a variety of collaborative scenarios.

פריטים דומים