Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Martirosyan, Emin, Cao, Ming
Format:	Preprint
Published:	2024
Subjects:	Optimization and Control Dynamical Systems
Online Access:	https://arxiv.org/abs/2403.02146
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913562720468992
author	Martirosyan, Emin Cao, Ming
author_facet	Martirosyan, Emin Cao, Ming
contents	In this paper, we address the inverse problem for linear-quadratic differential non-cooperative games with output-feedback. Given players' stabilizing feedback laws, the goal is to find cost function parameters that lead to a game for which the observed game dynamics are at a Nash equilibrium. Using the given feedback laws, we introduce a model-based algorithm that generates cost function parameters solving the above inverse problem. We introduce a correction procedure that at each iteration of the algorithm guarantees the existence of the feedback laws, which addresses a key challenge of output-feedback control designs. As an intermediate stage of the algorithm, we have developed a procedure for the initial stabilization of the multiple-input system with output-feedback information structure. We prove convergence and stability of the algorithm, and show the way to generate new games with necessary properties without requiring to run the complete algorithm repeatedly. Then the algorithm is extended to a model-free version that uses data samples generated by unknown dynamics and has the same converging and stabilizing properties as the model-based version. Finally, we show how the inverse problem can be solved in a distributed manner and provide possible extensions. Simulation results validate the effectiveness of the proposed algorithms.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_02146
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Reinforcement Learning for Inverse Non-Cooperative Linear-Quadratic Output-feedback Differential Games Martirosyan, Emin Cao, Ming Optimization and Control Dynamical Systems In this paper, we address the inverse problem for linear-quadratic differential non-cooperative games with output-feedback. Given players' stabilizing feedback laws, the goal is to find cost function parameters that lead to a game for which the observed game dynamics are at a Nash equilibrium. Using the given feedback laws, we introduce a model-based algorithm that generates cost function parameters solving the above inverse problem. We introduce a correction procedure that at each iteration of the algorithm guarantees the existence of the feedback laws, which addresses a key challenge of output-feedback control designs. As an intermediate stage of the algorithm, we have developed a procedure for the initial stabilization of the multiple-input system with output-feedback information structure. We prove convergence and stability of the algorithm, and show the way to generate new games with necessary properties without requiring to run the complete algorithm repeatedly. Then the algorithm is extended to a model-free version that uses data samples generated by unknown dynamics and has the same converging and stabilizing properties as the model-based version. Finally, we show how the inverse problem can be solved in a distributed manner and provide possible extensions. Simulation results validate the effectiveness of the proposed algorithms.
title	Reinforcement Learning for Inverse Non-Cooperative Linear-Quadratic Output-feedback Differential Games
topic	Optimization and Control Dynamical Systems
url	https://arxiv.org/abs/2403.02146

Similar Items