Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hu, Wei, Zhao, Yue, E, Weinan, Han, Jiequn, Long, Jihao
Format:	Preprint
Published:	2023
Subjects:	Optimization and Control Robotics
Online Access:	https://arxiv.org/abs/2311.17749
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918089484926976
author	Hu, Wei Zhao, Yue E, Weinan Han, Jiequn Long, Jihao
author_facet	Hu, Wei Zhao, Yue E, Weinan Han, Jiequn Long, Jihao
contents	This paper presents a novel approach to learning free terminal time closed-loop control for robotic manipulation tasks, enabling dynamic adjustment of task duration and control inputs to enhance performance. We extend the supervised learning approach, namely solving selected optimal open-loop problems and utilizing them as training data for a policy network, to the free terminal time scenario. Three main challenges are addressed in this extension. First, we introduce a marching scheme that enhances the solution quality and increases the success rate of the open-loop solver by gradually refining time discretization. Second, we extend the QRnet in Nakamura-Zimmerer et al. (2021b) to the free terminal time setting to address discontinuity and improve stability at the terminal state. Third, we present a more automated version of the initial value problem (IVP) enhanced sampling method from previous work (Zhang et al., 2022) to adaptively update the training dataset, significantly improving its quality. By integrating these techniques, we develop a closed-loop policy that operates effectively over a broad domain with varying optimal time durations, achieving near globally optimal total costs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2311_17749
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Learning Free Terminal Time Optimal Closed-loop Control of Manipulators Hu, Wei Zhao, Yue E, Weinan Han, Jiequn Long, Jihao Optimization and Control Robotics This paper presents a novel approach to learning free terminal time closed-loop control for robotic manipulation tasks, enabling dynamic adjustment of task duration and control inputs to enhance performance. We extend the supervised learning approach, namely solving selected optimal open-loop problems and utilizing them as training data for a policy network, to the free terminal time scenario. Three main challenges are addressed in this extension. First, we introduce a marching scheme that enhances the solution quality and increases the success rate of the open-loop solver by gradually refining time discretization. Second, we extend the QRnet in Nakamura-Zimmerer et al. (2021b) to the free terminal time setting to address discontinuity and improve stability at the terminal state. Third, we present a more automated version of the initial value problem (IVP) enhanced sampling method from previous work (Zhang et al., 2022) to adaptively update the training dataset, significantly improving its quality. By integrating these techniques, we develop a closed-loop policy that operates effectively over a broad domain with varying optimal time durations, achieving near globally optimal total costs.
title	Learning Free Terminal Time Optimal Closed-loop Control of Manipulators
topic	Optimization and Control Robotics
url	https://arxiv.org/abs/2311.17749

Similar Items