Sumário: :: Library Catalog

Na minha lista:

Detalhes bibliográficos
Main Authors:	Maheshwari, Chinmay, Cheng, James, Sasty, S. Shankar, Ratliff, Lillian, Mazumdar, Eric
Formato:	Preprint
Publicado em:	2023
Assuntos:	Optimization and Control Artificial Intelligence Computer Science and Game Theory Dynamical Systems 91A65
Acesso em linha:	https://arxiv.org/abs/2302.01421
Tags:	Adicionar Tag Sem tags, seja o primeiro a adicionar uma tag!

Sumário:

In this paper, we present an efficient algorithm to solve online Stackelberg games, featuring multiple followers, in a follower-agnostic manner. Unlike previous works, our approach works even when leader has no knowledge about the followers' utility functions or strategy space. Our algorithm introduces a unique gradient estimator, leveraging specially designed strategies to probe followers. In a departure from traditional assumptions of optimal play, we model followers' responses using a convergent adaptation rule, allowing for realistic and dynamic interactions. The leader constructs the gradient estimator solely based on observations of followers' actions. We provide both non-asymptotic convergence rates to stationary points of the leader's objective and demonstrate asymptotic convergence to a \emph{local Stackelberg equilibrium}. To validate the effectiveness of our algorithm, we use this algorithm to solve the problem of incentive design on a large-scale transportation network, showcasing its robustness even when the leader lacks access to followers' demand.

Registos relacionados