Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Benjamin, Patrick, Abate, Alessandro
Format:	Preprint
Published:	2024
Subjects:	Multiagent Systems Artificial Intelligence Computer Science and Game Theory Machine Learning Systems and Control
Online Access:	https://arxiv.org/abs/2408.11607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912780917932032
author	Benjamin, Patrick Abate, Alessandro
author_facet	Benjamin, Patrick Abate, Alessandro
contents	Recent algorithms allow decentralised agents, possibly connected via a communication network, to learn equilibria in mean-field games from a non-episodic run of the empirical system. However, these algorithms are for tabular settings: this computationally limits the size of agents' observation space, meaning the algorithms cannot handle anything but small state spaces, nor generalise beyond policies depending only on the agent's local state to so-called 'population-dependent' policies. We address this limitation by introducing function approximation to the existing setting, drawing on the Munchausen Online Mirror Descent method that has previously been employed only in finite-horizon, episodic, centralised settings. While this permits us to include the mean field in the observation for players' policies, it is unrealistic to assume decentralised agents have access to this global information: we therefore also provide new algorithms allowing agents to locally estimate the global empirical distribution, and to improve this estimate via inter-agent communication. We prove theoretically that exchanging policy information helps networked agents outperform both independent and even centralised agents in function-approximation settings. Our experiments demonstrate this happening empirically, and show that the communication network allows decentralised agents to estimate the mean field for population-dependent policies.
format	Preprint
id	arxiv_https___arxiv_org_abs_2408_11607
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation Benjamin, Patrick Abate, Alessandro Multiagent Systems Artificial Intelligence Computer Science and Game Theory Machine Learning Systems and Control Recent algorithms allow decentralised agents, possibly connected via a communication network, to learn equilibria in mean-field games from a non-episodic run of the empirical system. However, these algorithms are for tabular settings: this computationally limits the size of agents' observation space, meaning the algorithms cannot handle anything but small state spaces, nor generalise beyond policies depending only on the agent's local state to so-called 'population-dependent' policies. We address this limitation by introducing function approximation to the existing setting, drawing on the Munchausen Online Mirror Descent method that has previously been employed only in finite-horizon, episodic, centralised settings. While this permits us to include the mean field in the observation for players' policies, it is unrealistic to assume decentralised agents have access to this global information: we therefore also provide new algorithms allowing agents to locally estimate the global empirical distribution, and to improve this estimate via inter-agent communication. We prove theoretically that exchanging policy information helps networked agents outperform both independent and even centralised agents in function-approximation settings. Our experiments demonstrate this happening empirically, and show that the communication network allows decentralised agents to estimate the mean field for population-dependent policies.
title	Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation
topic	Multiagent Systems Artificial Intelligence Computer Science and Game Theory Machine Learning Systems and Control
url	https://arxiv.org/abs/2408.11607

Similar Items