Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Xuanxi, Long, Jihao, Hu, Wei, E, Weinan, Han, Jiequn
Format:	Preprint
Published:	2022
Subjects:	Optimization and Control
Online Access:	https://arxiv.org/abs/2209.04078
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911345241227264
author	Zhang, Xuanxi Long, Jihao Hu, Wei E, Weinan Han, Jiequn
author_facet	Zhang, Xuanxi Long, Jihao Hu, Wei E, Weinan Han, Jiequn
contents	Closed-loop optimal control design for high-dimensional nonlinear systems has been a long-standing challenge. Traditional methods, such as solving the associated Hamilton-Jacobi-Bellman equation, suffer from the curse of dimensionality. Recent literature proposed a new promising approach based on supervised learning, by leveraging powerful open-loop optimal control solvers to generate training data and neural networks as efficient high-dimensional function approximators to fit the closed-loop optimal control. This approach successfully handles certain high-dimensional optimal control problems but still performs poorly on more challenging problems. One of the crucial reasons for the failure is the so-called distribution mismatch phenomenon brought by the controlled dynamics. In this paper, we investigate this phenomenon and propose the Progressive Optimal Path Sampling (POPS) method to mitigate this problem. We theoretically prove that this enhanced sampling strategy outperforms both the vanilla approach and the widely used Dataset Aggregation (DAgger) method on the classical linear-quadratic regulator by a factor proportional to the total time duration. We further numerically demonstrate that the proposed sampling strategy significantly improves the performance on tested control problems, including the optimal landing problem of a quadrotor and the optimal reaching problem of a 7 DoF manipulator.
format	Preprint
id	arxiv_https___arxiv_org_abs_2209_04078
institution	arXiv
publishDate	2022
record_format	arxiv
spellingShingle	Progressive Optimal Path Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks Zhang, Xuanxi Long, Jihao Hu, Wei E, Weinan Han, Jiequn Optimization and Control Closed-loop optimal control design for high-dimensional nonlinear systems has been a long-standing challenge. Traditional methods, such as solving the associated Hamilton-Jacobi-Bellman equation, suffer from the curse of dimensionality. Recent literature proposed a new promising approach based on supervised learning, by leveraging powerful open-loop optimal control solvers to generate training data and neural networks as efficient high-dimensional function approximators to fit the closed-loop optimal control. This approach successfully handles certain high-dimensional optimal control problems but still performs poorly on more challenging problems. One of the crucial reasons for the failure is the so-called distribution mismatch phenomenon brought by the controlled dynamics. In this paper, we investigate this phenomenon and propose the Progressive Optimal Path Sampling (POPS) method to mitigate this problem. We theoretically prove that this enhanced sampling strategy outperforms both the vanilla approach and the widely used Dataset Aggregation (DAgger) method on the classical linear-quadratic regulator by a factor proportional to the total time duration. We further numerically demonstrate that the proposed sampling strategy significantly improves the performance on tested control problems, including the optimal landing problem of a quadrotor and the optimal reaching problem of a 7 DoF manipulator.
title	Progressive Optimal Path Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks
topic	Optimization and Control
url	https://arxiv.org/abs/2209.04078

Similar Items