Saved in:
Bibliographic Details
Main Authors: Wibault, Clarisse, Forkel, Johannes, Towers, Sebastian, Wibault, Tiphaine, Duque, Juan, Whittle, George, Schaab, Andreas, Yang, Yucheng, Wang, Chiyuan, Osborne, Maike, Moll, Benjamin, Foerster, Jakob
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.20141
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916060260728832
author Wibault, Clarisse
Forkel, Johannes
Towers, Sebastian
Wibault, Tiphaine
Duque, Juan
Whittle, George
Schaab, Andreas
Yang, Yucheng
Wang, Chiyuan
Osborne, Maike
Moll, Benjamin
Foerster, Jakob
author_facet Wibault, Clarisse
Forkel, Johannes
Towers, Sebastian
Wibault, Tiphaine
Duque, Juan
Whittle, George
Schaab, Andreas
Yang, Yucheng
Wang, Chiyuan
Osborne, Maike
Moll, Benjamin
Foerster, Jakob
contents Mean Field Games (MFGs) provide a principled framework for modelling interactions in large population systems. However, algorithmic progress has been limited since model-free methods are high variance and exact methods scale poorly. Recent Hybrid Structural Methods (HSMs) reduce variance while maintaining tractability by leveraging low-dimensional individual state and action spaces and known transition dynamics to compute the exact expected return conditioned on Monte Carlo rollouts of common noise. However, HSMs have not been extended to partially observable settings. We propose Recurrent Structural Policy Gradient (RSPG), the first history-aware HSM for MFGs with public partial information. RSPG achieves an order-of-magnitude faster convergence than model-free RL methods while learning history-aware behaviour, unlike current HSMs. To facilitate research into MFGs, we also introduce MFAX, our JAX-based framework for MFGs that supports both analytic and sample-based mean-field updates. MFAX and usage examples can be found at https://clarisse-wibault.github.io/rspg/.
format Preprint
id arxiv_https___arxiv_org_abs_2602_20141
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Recurrent Structural Policy Gradient for Partially Observable Mean Field Games
Wibault, Clarisse
Forkel, Johannes
Towers, Sebastian
Wibault, Tiphaine
Duque, Juan
Whittle, George
Schaab, Andreas
Yang, Yucheng
Wang, Chiyuan
Osborne, Maike
Moll, Benjamin
Foerster, Jakob
Artificial Intelligence
Mean Field Games (MFGs) provide a principled framework for modelling interactions in large population systems. However, algorithmic progress has been limited since model-free methods are high variance and exact methods scale poorly. Recent Hybrid Structural Methods (HSMs) reduce variance while maintaining tractability by leveraging low-dimensional individual state and action spaces and known transition dynamics to compute the exact expected return conditioned on Monte Carlo rollouts of common noise. However, HSMs have not been extended to partially observable settings. We propose Recurrent Structural Policy Gradient (RSPG), the first history-aware HSM for MFGs with public partial information. RSPG achieves an order-of-magnitude faster convergence than model-free RL methods while learning history-aware behaviour, unlike current HSMs. To facilitate research into MFGs, we also introduce MFAX, our JAX-based framework for MFGs that supports both analytic and sample-based mean-field updates. MFAX and usage examples can be found at https://clarisse-wibault.github.io/rspg/.
title Recurrent Structural Policy Gradient for Partially Observable Mean Field Games
topic Artificial Intelligence
url https://arxiv.org/abs/2602.20141