Sommario: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Sun, Weihao, Xu, Gehui, Moreschini, Alessio, Parisini, Thomas, Malikopoulos, Andreas A.
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Systems and Control
Accesso online:	https://arxiv.org/abs/2604.01056
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Sommario:

In this paper, we develop a kernel-based policy iteration functional learning framework for computing team-optimal strategies in traffic coordination problems. We consider a multi-agent discrete-time linear system with a cost function that combines quadratic regulation terms and nonlinear safety penalties. Building on the Hilbert space formulation of offline receding-horizon policy iteration, we seek approximate solutions within a reproducing kernel Hilbert space, where the policy improvement step is implemented via a discrete Fréchet derivative. We further study the model-free receding-horizon scenario, where the system dynamics are estimated using recursive least squares, followed by updating the policy using rolling online data. The proposed method is tested in signal-free intersection scenarios via both model-based and model-free simulations and validated in SUMO.

Documenti analoghi