Saved in:
Bibliographic Details
Main Authors: Zhang, Zuyuan, Zhou, Hanhan, Imani, Mahdi, Lee, Taeyoung, Lan, Tian
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.15341
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909146967703552
author Zhang, Zuyuan
Zhou, Hanhan
Imani, Mahdi
Lee, Taeyoung
Lan, Tian
author_facet Zhang, Zuyuan
Zhou, Hanhan
Imani, Mahdi
Lee, Taeyoung
Lan, Tian
contents With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand. However, existing approaches for training collaborative agents often require defined and known reward signals and cannot address the problem of teaming with unknown agents that often have latent objectives/rewards. In response to this challenge, we propose teaming with unknown agents framework, which leverages kernel density Bayesian inverse learning method for active goal deduction and utilizes pre-trained, goal-conditioned policies to enable zero-shot policy adaptation. We prove that unbiased reward estimates in our framework are sufficient for optimal teaming with unknown agents. We further evaluate the framework of redesigned multi-agent particle and StarCraft II micromanagement environments with diverse unknown agents of different behaviors/rewards. Empirical results demonstrate that our framework significantly advances the teaming performance of AI and unknown agents in a wide range of collaborative scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2403_15341
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Collaborative AI Teaming in Unknown Environments via Active Goal Deduction
Zhang, Zuyuan
Zhou, Hanhan
Imani, Mahdi
Lee, Taeyoung
Lan, Tian
Artificial Intelligence
Multiagent Systems
With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand. However, existing approaches for training collaborative agents often require defined and known reward signals and cannot address the problem of teaming with unknown agents that often have latent objectives/rewards. In response to this challenge, we propose teaming with unknown agents framework, which leverages kernel density Bayesian inverse learning method for active goal deduction and utilizes pre-trained, goal-conditioned policies to enable zero-shot policy adaptation. We prove that unbiased reward estimates in our framework are sufficient for optimal teaming with unknown agents. We further evaluate the framework of redesigned multi-agent particle and StarCraft II micromanagement environments with diverse unknown agents of different behaviors/rewards. Empirical results demonstrate that our framework significantly advances the teaming performance of AI and unknown agents in a wide range of collaborative scenarios.
title Collaborative AI Teaming in Unknown Environments via Active Goal Deduction
topic Artificial Intelligence
Multiagent Systems
url https://arxiv.org/abs/2403.15341