Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wibault, Clarisse, Forkel, Johannes, Towers, Sebastian, Wibault, Tiphaine, Duque, Juan, Whittle, George, Schaab, Andreas, Yang, Yucheng, Wang, Chiyuan, Osborne, Maike, Moll, Benjamin, Foerster, Jakob
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.20141
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916060260728832
author	Wibault, Clarisse Forkel, Johannes Towers, Sebastian Wibault, Tiphaine Duque, Juan Whittle, George Schaab, Andreas Yang, Yucheng Wang, Chiyuan Osborne, Maike Moll, Benjamin Foerster, Jakob
author_facet	Wibault, Clarisse Forkel, Johannes Towers, Sebastian Wibault, Tiphaine Duque, Juan Whittle, George Schaab, Andreas Yang, Yucheng Wang, Chiyuan Osborne, Maike Moll, Benjamin Foerster, Jakob
contents	Mean Field Games (MFGs) provide a principled framework for modelling interactions in large population systems. However, algorithmic progress has been limited since model-free methods are high variance and exact methods scale poorly. Recent Hybrid Structural Methods (HSMs) reduce variance while maintaining tractability by leveraging low-dimensional individual state and action spaces and known transition dynamics to compute the exact expected return conditioned on Monte Carlo rollouts of common noise. However, HSMs have not been extended to partially observable settings. We propose Recurrent Structural Policy Gradient (RSPG), the first history-aware HSM for MFGs with public partial information. RSPG achieves an order-of-magnitude faster convergence than model-free RL methods while learning history-aware behaviour, unlike current HSMs. To facilitate research into MFGs, we also introduce MFAX, our JAX-based framework for MFGs that supports both analytic and sample-based mean-field updates. MFAX and usage examples can be found at https://clarisse-wibault.github.io/rspg/.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_20141
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Recurrent Structural Policy Gradient for Partially Observable Mean Field Games Wibault, Clarisse Forkel, Johannes Towers, Sebastian Wibault, Tiphaine Duque, Juan Whittle, George Schaab, Andreas Yang, Yucheng Wang, Chiyuan Osborne, Maike Moll, Benjamin Foerster, Jakob Artificial Intelligence Mean Field Games (MFGs) provide a principled framework for modelling interactions in large population systems. However, algorithmic progress has been limited since model-free methods are high variance and exact methods scale poorly. Recent Hybrid Structural Methods (HSMs) reduce variance while maintaining tractability by leveraging low-dimensional individual state and action spaces and known transition dynamics to compute the exact expected return conditioned on Monte Carlo rollouts of common noise. However, HSMs have not been extended to partially observable settings. We propose Recurrent Structural Policy Gradient (RSPG), the first history-aware HSM for MFGs with public partial information. RSPG achieves an order-of-magnitude faster convergence than model-free RL methods while learning history-aware behaviour, unlike current HSMs. To facilitate research into MFGs, we also introduce MFAX, our JAX-based framework for MFGs that supports both analytic and sample-based mean-field updates. MFAX and usage examples can be found at https://clarisse-wibault.github.io/rspg/.
title	Recurrent Structural Policy Gradient for Partially Observable Mean Field Games
topic	Artificial Intelligence
url	https://arxiv.org/abs/2602.20141

Similar Items