Saved in:
| Main Authors: | Gessler, Tobias, Dizdarevic, Tin, Calinescu, Ani, Ellis, Benjamin, Lupu, Andrei, Foerster, Jakob Nicolaus |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.17821 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ad-Hoc Human-AI Coordination Challenge
by: Dizdarević, Tin, et al.
Published: (2025)
by: Dizdarević, Tin, et al.
Published: (2025)
Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents
by: Sun, Haochen, et al.
Published: (2025)
by: Sun, Haochen, et al.
Published: (2025)
CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants
by: Alberts, Lize, et al.
Published: (2024)
by: Alberts, Lize, et al.
Published: (2024)
The Overcooked Generalisation Challenge: Evaluating Cooperation with Novel Partners in Unknown Environments Using Unsupervised Environment Design
by: Ruhdorfer, Constantin, et al.
Published: (2024)
by: Ruhdorfer, Constantin, et al.
Published: (2024)
The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind
by: Lupu, Andrei, et al.
Published: (2025)
by: Lupu, Andrei, et al.
Published: (2025)
Behaviour Distillation
by: Lupu, Andrei, et al.
Published: (2024)
by: Lupu, Andrei, et al.
Published: (2024)
DéjàQ: Open-Ended Evolution of Diverse, Learnable and Verifiable Problems
by: Röpke, Willem, et al.
Published: (2026)
by: Röpke, Willem, et al.
Published: (2026)
Beyond the Boundaries of Proximal Policy Optimization
by: Tan, Charlie B., et al.
Published: (2024)
by: Tan, Charlie B., et al.
Published: (2024)
AI & Human Co-Improvement for Safer Co-Superintelligence
by: Weston, Jason, et al.
Published: (2025)
by: Weston, Jason, et al.
Published: (2025)
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
by: Barde, Paul, et al.
Published: (2023)
by: Barde, Paul, et al.
Published: (2023)
Policy-Guided Diffusion
by: Jackson, Matthew Thomas, et al.
Published: (2024)
by: Jackson, Matthew Thomas, et al.
Published: (2024)
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
by: Nasvytis, Linas, et al.
Published: (2024)
by: Nasvytis, Linas, et al.
Published: (2024)
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
by: Anwar, Usman, et al.
Published: (2024)
by: Anwar, Usman, et al.
Published: (2024)
Automatic Curriculum Design for Zero-Shot Human-AI Coordination
by: You, Won-Sang, et al.
Published: (2025)
by: You, Won-Sang, et al.
Published: (2025)
JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)
by: Coward, Samuel, et al.
Published: (2024)
How Should We Meta-Learn Reinforcement Learning Algorithms?
by: Goldie, Alexander David, et al.
Published: (2025)
by: Goldie, Alexander David, et al.
Published: (2025)
Meta-Learning Objectives for Preference Optimization
by: Alfano, Carlo, et al.
Published: (2024)
by: Alfano, Carlo, et al.
Published: (2024)
Efficient Reinforcement Learning for Zero-Shot Coordination in Evolving Games
by: Hui, Bingyu, et al.
Published: (2025)
by: Hui, Bingyu, et al.
Published: (2025)
Emergent Risk Awareness in Rational Agents under Resource Constraints
by: Ornia, Daniel Jarne, et al.
Published: (2025)
by: Ornia, Daniel Jarne, et al.
Published: (2025)
Shaping Zero-Shot Coordination via State Blocking
by: Kang, Mingu, et al.
Published: (2026)
by: Kang, Mingu, et al.
Published: (2026)
Asking the Right Questions: Improving Reasoning with Generated Stepping Stones
by: Hu, Hengyuan, et al.
Published: (2026)
by: Hu, Hengyuan, et al.
Published: (2026)
A Clean Slate for Offline Reinforcement Learning
by: Jackson, Matthew Thomas, et al.
Published: (2025)
by: Jackson, Matthew Thomas, et al.
Published: (2025)
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
by: Li, Yang, et al.
Published: (2023)
by: Li, Yang, et al.
Published: (2023)
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
by: Samvelyan, Mikayel, et al.
Published: (2024)
by: Samvelyan, Mikayel, et al.
Published: (2024)
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps
by: Ellis, Benjamin, et al.
Published: (2024)
by: Ellis, Benjamin, et al.
Published: (2024)
Can Learned Optimization Make Reinforcement Learning Less Difficult?
by: Goldie, Alexander David, et al.
Published: (2024)
by: Goldie, Alexander David, et al.
Published: (2024)
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition
by: Zhang, Ziyang, et al.
Published: (2024)
by: Zhang, Ziyang, et al.
Published: (2024)
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
by: Rutherford, Alexander, et al.
Published: (2023)
by: Rutherford, Alexander, et al.
Published: (2023)
AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement
by: Rosser, J, et al.
Published: (2025)
by: Rosser, J, et al.
Published: (2025)
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
by: Matthews, Michael, et al.
Published: (2024)
by: Matthews, Michael, et al.
Published: (2024)
Optimizing Split Learning Latency in TinyML-Based IoT Systems
by: Jenhani, Zied, et al.
Published: (2025)
by: Jenhani, Zied, et al.
Published: (2025)
Towards Zero-Shot Coordination between Teams of Agents: The N-XPlay Framework
by: Abderezaei, Ava, et al.
Published: (2025)
by: Abderezaei, Ava, et al.
Published: (2025)
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
by: Xue, Ke, et al.
Published: (2022)
by: Xue, Ke, et al.
Published: (2022)
Zero-Shot Robustification of Zero-Shot Models
by: Adila, Dyah, et al.
Published: (2023)
by: Adila, Dyah, et al.
Published: (2023)
JaxLife: An Open-Ended Agentic Simulator
by: Lu, Chris, et al.
Published: (2024)
by: Lu, Chris, et al.
Published: (2024)
Discovering Temporally-Aware Reinforcement Learning Algorithms
by: Jackson, Matthew Thomas, et al.
Published: (2024)
by: Jackson, Matthew Thomas, et al.
Published: (2024)
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)
by: Sims, Anya, et al.
Published: (2024)
Mirror Learning: A Unifying Framework of Policy Optimisation
by: Kuba, Jakub Grudzien, et al.
Published: (2022)
by: Kuba, Jakub Grudzien, et al.
Published: (2022)
Learning Multi-Agent Communication with Contrastive Learning
by: Lo, Yat Long, et al.
Published: (2023)
by: Lo, Yat Long, et al.
Published: (2023)
Equivariant Networks for Zero-Shot Coordination
by: Muglich, Darius, et al.
Published: (2022)
by: Muglich, Darius, et al.
Published: (2022)
Similar Items
-
Ad-Hoc Human-AI Coordination Challenge
by: Dizdarević, Tin, et al.
Published: (2025) -
Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents
by: Sun, Haochen, et al.
Published: (2025) -
CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants
by: Alberts, Lize, et al.
Published: (2024) -
The Overcooked Generalisation Challenge: Evaluating Cooperation with Novel Partners in Unknown Environments Using Unsupervised Environment Design
by: Ruhdorfer, Constantin, et al.
Published: (2024) -
The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind
by: Lupu, Andrei, et al.
Published: (2025)