Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Gao, Fang, Li, XueTao, Yu, Jun, Shaung, Feng
Format:	Preprint
Published:	2023
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2307.11343
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910285473775616
author	Gao, Fang Li, XueTao Yu, Jun Shaung, Feng
author_facet	Gao, Fang Li, XueTao Yu, Jun Shaung, Feng
contents	The advent of Chat-GPT has led to a surge of interest in Embodied AI. However, many existing Embodied AI models heavily rely on massive interactions with training environments, which may not be practical in real-world situations. To this end, the Maniskill2 has introduced a full-physics simulation benchmark for manipulating various 3D objects. This benchmark enables agents to be trained using diverse datasets of demonstrations and evaluates their ability to generalize to unseen scenarios in testing environments. In this paper, we propose a novel two-stage fine-tuning strategy that aims to further enhance the generalization capability of our model based on the Maniskill2 benchmark. Through extensive experiments, we demonstrate the effectiveness of our approach by achieving the 1st prize in all three tracks of the ManiSkill2 Challenge. Our findings highlight the potential of our method to improve the generalization abilities of Embodied AI models and pave the way for their ractical applications in real-world scenarios. All codes and models of our solution is available at https://github.com/xtli12/GXU-LIPE.git
format	Preprint
id	arxiv_https___arxiv_org_abs_2307_11343
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of Embodied AI Gao, Fang Li, XueTao Yu, Jun Shaung, Feng Artificial Intelligence The advent of Chat-GPT has led to a surge of interest in Embodied AI. However, many existing Embodied AI models heavily rely on massive interactions with training environments, which may not be practical in real-world situations. To this end, the Maniskill2 has introduced a full-physics simulation benchmark for manipulating various 3D objects. This benchmark enables agents to be trained using diverse datasets of demonstrations and evaluates their ability to generalize to unseen scenarios in testing environments. In this paper, we propose a novel two-stage fine-tuning strategy that aims to further enhance the generalization capability of our model based on the Maniskill2 benchmark. Through extensive experiments, we demonstrate the effectiveness of our approach by achieving the 1st prize in all three tracks of the ManiSkill2 Challenge. Our findings highlight the potential of our method to improve the generalization abilities of Embodied AI models and pave the way for their ractical applications in real-world scenarios. All codes and models of our solution is available at https://github.com/xtli12/GXU-LIPE.git
title	A Two-stage Fine-tuning Strategy for Generalizable Manipulation Skill of Embodied AI
topic	Artificial Intelligence
url	https://arxiv.org/abs/2307.11343

Similar Items