Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Asl, Hamed Jabbari, Uchibe, Eiji
Format:	Preprint
Published:	2025
Subjects:	Systems and Control
Online Access:	https://arxiv.org/abs/2503.09090
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912281958285312
author	Asl, Hamed Jabbari Uchibe, Eiji
author_facet	Asl, Hamed Jabbari Uchibe, Eiji
contents	This paper introduces a novel model-free and a partially model-free algorithm for inverse optimal control (IOC), also known as inverse reinforcement learning (IRL), aimed at estimating the cost function of continuous-time nonlinear deterministic systems. Using the input-state trajectories of an expert agent, the proposed algorithms separately utilize control policy information and the Hamilton-Jacobi-Bellman equation to estimate different sets of cost function parameters. This approach allows the algorithms to achieve broader applicability while maintaining a model-free framework. Also, the model-free algorithm reduces complexity compared to existing methods, as it requires solving a forward optimal control problem only once during initialization. Furthermore, in our partially model-free algorithm, this step can be bypassed entirely for systems with known input dynamics. Simulation results demonstrate the effectiveness and efficiency of our algorithms, highlighting their potential for real-world deployment in autonomous systems and robotics.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_09090
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Data-Driven Inverse Optimal Control for Continuous-Time Nonlinear Systems Asl, Hamed Jabbari Uchibe, Eiji Systems and Control This paper introduces a novel model-free and a partially model-free algorithm for inverse optimal control (IOC), also known as inverse reinforcement learning (IRL), aimed at estimating the cost function of continuous-time nonlinear deterministic systems. Using the input-state trajectories of an expert agent, the proposed algorithms separately utilize control policy information and the Hamilton-Jacobi-Bellman equation to estimate different sets of cost function parameters. This approach allows the algorithms to achieve broader applicability while maintaining a model-free framework. Also, the model-free algorithm reduces complexity compared to existing methods, as it requires solving a forward optimal control problem only once during initialization. Furthermore, in our partially model-free algorithm, this step can be bypassed entirely for systems with known input dynamics. Simulation results demonstrate the effectiveness and efficiency of our algorithms, highlighting their potential for real-world deployment in autonomous systems and robotics.
title	Data-Driven Inverse Optimal Control for Continuous-Time Nonlinear Systems
topic	Systems and Control
url	https://arxiv.org/abs/2503.09090

Similar Items