Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Deng, Yuhong, Mo, Kai, Xia, Chongkun, Wang, Xueqian
Format:	Preprint
Published:	2023
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2303.01310
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910310268403712
author	Deng, Yuhong Mo, Kai Xia, Chongkun Wang, Xueqian
author_facet	Deng, Yuhong Mo, Kai Xia, Chongkun Wang, Xueqian
contents	Multi-task learning of deformable object manipulation is a challenging problem in robot manipulation. Most previous works address this problem in a goal-conditioned way and adapt goal images to specify different tasks, which limits the multi-task learning performance and can not generalize to new tasks. Thus, we adapt language instruction to specify deformable object manipulation tasks and propose a learning framework. We first design a unified Transformer-based architecture to understand multi-modal data and output picking and placing action. Besides, we have introduced the visible connectivity graph to tackle nonlinear dynamics and complex configuration of the deformable object. Both simulated and real experiments have demonstrated that the proposed method is effective and can generalize to unseen instructions and tasks. Compared with the state-of-the-art method, our method achieves higher success rates (87.2% on average) and has a 75.6% shorter inference time. We also demonstrate that our method performs well in real-world experiments.
format	Preprint
id	arxiv_https___arxiv_org_abs_2303_01310
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Learning Language-Conditioned Deformable Object Manipulation with Graph Dynamics Deng, Yuhong Mo, Kai Xia, Chongkun Wang, Xueqian Robotics Multi-task learning of deformable object manipulation is a challenging problem in robot manipulation. Most previous works address this problem in a goal-conditioned way and adapt goal images to specify different tasks, which limits the multi-task learning performance and can not generalize to new tasks. Thus, we adapt language instruction to specify deformable object manipulation tasks and propose a learning framework. We first design a unified Transformer-based architecture to understand multi-modal data and output picking and placing action. Besides, we have introduced the visible connectivity graph to tackle nonlinear dynamics and complex configuration of the deformable object. Both simulated and real experiments have demonstrated that the proposed method is effective and can generalize to unseen instructions and tasks. Compared with the state-of-the-art method, our method achieves higher success rates (87.2% on average) and has a 75.6% shorter inference time. We also demonstrate that our method performs well in real-world experiments.
title	Learning Language-Conditioned Deformable Object Manipulation with Graph Dynamics
topic	Robotics
url	https://arxiv.org/abs/2303.01310

Similar Items