Saved in:
Bibliographic Details
Main Authors: Gessler, Tobias, Dizdarevic, Tin, Calinescu, Ani, Ellis, Benjamin, Lupu, Andrei, Foerster, Jakob Nicolaus
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.17821
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908279751311360
author Gessler, Tobias
Dizdarevic, Tin
Calinescu, Ani
Ellis, Benjamin
Lupu, Andrei
Foerster, Jakob Nicolaus
author_facet Gessler, Tobias
Dizdarevic, Tin
Calinescu, Ani
Ellis, Benjamin
Lupu, Andrei
Foerster, Jakob Nicolaus
contents AI agents hold the potential to transform everyday life by helping humans achieve their goals. To do this successfully, agents need to be able to coordinate with novel partners without prior interaction, a setting known as zero-shot coordination (ZSC). Overcooked has become one of the most popular benchmarks for evaluating coordination capabilities of AI agents and learning algorithms. In this work, we investigate the origins of ZSC challenges in Overcooked. We introduce a state augmentation mechanism which mixes states that might be encountered when paired with unknown partners into the training distribution, reducing the out-of-distribution challenge associated with ZSC. We show that independently trained agents under this algorithm coordinate successfully in Overcooked. Our results suggest that ZSC failure can largely be attributed to poor state coverage under self-play rather than more sophisticated coordination challenges. The Overcooked environment is therefore not suitable as a ZSC benchmark. To address these shortcomings, we introduce OvercookedV2, a new version of the benchmark, which includes asymmetric information and stochasticity, facilitating the creation of interesting ZSC scenarios. To validate OvercookedV2, we conduct experiments demonstrating that mere exhaustive state coverage is insufficient to coordinate well. Finally, we use OvercookedV2 to build a new range of coordination challenges, including ones that require test time protocol formation, and we demonstrate the need for new coordination algorithms that can adapt online. We hope that OvercookedV2 will help benchmark the next generation of ZSC algorithms and advance collaboration between AI agents and humans.
format Preprint
id arxiv_https___arxiv_org_abs_2503_17821
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
Gessler, Tobias
Dizdarevic, Tin
Calinescu, Ani
Ellis, Benjamin
Lupu, Andrei
Foerster, Jakob Nicolaus
Artificial Intelligence
AI agents hold the potential to transform everyday life by helping humans achieve their goals. To do this successfully, agents need to be able to coordinate with novel partners without prior interaction, a setting known as zero-shot coordination (ZSC). Overcooked has become one of the most popular benchmarks for evaluating coordination capabilities of AI agents and learning algorithms. In this work, we investigate the origins of ZSC challenges in Overcooked. We introduce a state augmentation mechanism which mixes states that might be encountered when paired with unknown partners into the training distribution, reducing the out-of-distribution challenge associated with ZSC. We show that independently trained agents under this algorithm coordinate successfully in Overcooked. Our results suggest that ZSC failure can largely be attributed to poor state coverage under self-play rather than more sophisticated coordination challenges. The Overcooked environment is therefore not suitable as a ZSC benchmark. To address these shortcomings, we introduce OvercookedV2, a new version of the benchmark, which includes asymmetric information and stochasticity, facilitating the creation of interesting ZSC scenarios. To validate OvercookedV2, we conduct experiments demonstrating that mere exhaustive state coverage is insufficient to coordinate well. Finally, we use OvercookedV2 to build a new range of coordination challenges, including ones that require test time protocol formation, and we demonstrate the need for new coordination algorithms that can adapt online. We hope that OvercookedV2 will help benchmark the next generation of ZSC algorithms and advance collaboration between AI agents and humans.
title OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
topic Artificial Intelligence
url https://arxiv.org/abs/2503.17821