Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Jing, Dean, Emmanuel, Ramirez-Amaro, Karinne
Format:	Preprint
Published:	2023
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2309.14237
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916304385998848
author	Zhang, Jing Dean, Emmanuel Ramirez-Amaro, Karinne
author_facet	Zhang, Jing Dean, Emmanuel Ramirez-Amaro, Karinne
contents	Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of robotic manipulation, particularly when using reinforcement learning (RL) methods which often struggle to learn the correct sequence of actions for achieving these complex goals. To learn this sequence, symbolic planning methods offer a good solution based on high-level reasoning, however, planners often fall short in addressing the low-level control specificity needed for precise execution. This paper introduces a novel framework that integrates symbolic planning with hierarchical RL through the cooperation of high-level operators and low-level policies. Our contribution integrates planning operators (e.g. preconditions and effects) as part of the hierarchical RL algorithm based on the Scheduled Auxiliary Control (SAC-X) method. We developed a dual-purpose high-level operator, which can be used both in holistic planning and as independent, reusable policies. Our approach offers a flexible solution for long-horizon tasks, e.g., stacking a cube. The experimental results show that our proposed method obtained an average of 97.2% success rate for learning and executing the whole stack sequence, and the success rate for learning independent policies, e.g. reach (98.9%), lift (99.7%), stack (85%), etc. The training time is also reduced by 68% when using our proposed approach.
format	Preprint
id	arxiv_https___arxiv_org_abs_2309_14237
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Hierarchical Reinforcement Learning Based on Planning Operators Zhang, Jing Dean, Emmanuel Ramirez-Amaro, Karinne Robotics Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of robotic manipulation, particularly when using reinforcement learning (RL) methods which often struggle to learn the correct sequence of actions for achieving these complex goals. To learn this sequence, symbolic planning methods offer a good solution based on high-level reasoning, however, planners often fall short in addressing the low-level control specificity needed for precise execution. This paper introduces a novel framework that integrates symbolic planning with hierarchical RL through the cooperation of high-level operators and low-level policies. Our contribution integrates planning operators (e.g. preconditions and effects) as part of the hierarchical RL algorithm based on the Scheduled Auxiliary Control (SAC-X) method. We developed a dual-purpose high-level operator, which can be used both in holistic planning and as independent, reusable policies. Our approach offers a flexible solution for long-horizon tasks, e.g., stacking a cube. The experimental results show that our proposed method obtained an average of 97.2% success rate for learning and executing the whole stack sequence, and the success rate for learning independent policies, e.g. reach (98.9%), lift (99.7%), stack (85%), etc. The training time is also reduced by 68% when using our proposed approach.
title	Hierarchical Reinforcement Learning Based on Planning Operators
topic	Robotics
url	https://arxiv.org/abs/2309.14237

Similar Items