Enregistré dans:
Détails bibliographiques
Auteurs principaux: Li, Dongjiang, Peng, Bo, Li, Chang, Qiao, Ning, Zheng, Qi, Sun, Lei, Qin, Yusen, Li, Bangguo, Luan, Yifeng, Wu, Bo, Zhan, Yibing, Sun, Mingang, Xu, Tong, Li, Lusong, Shen, Hui, He, Xiaodong
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2501.15068
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866909477096128512
author Li, Dongjiang
Peng, Bo
Li, Chang
Qiao, Ning
Zheng, Qi
Sun, Lei
Qin, Yusen
Li, Bangguo
Luan, Yifeng
Wu, Bo
Zhan, Yibing
Sun, Mingang
Xu, Tong
Li, Lusong
Shen, Hui
He, Xiaodong
author_facet Li, Dongjiang
Peng, Bo
Li, Chang
Qiao, Ning
Zheng, Qi
Sun, Lei
Qin, Yusen
Li, Bangguo
Luan, Yifeng
Wu, Bo
Zhan, Yibing
Sun, Mingang
Xu, Tong
Li, Lusong
Shen, Hui
He, Xiaodong
contents Embodied manipulation is a fundamental ability in the realm of embodied artificial intelligence. Although current embodied manipulation models show certain generalizations in specific settings, they struggle in new environments and tasks due to the complexity and diversity of real-world scenarios. The traditional end-to-end data collection and training manner leads to significant data demands. Decomposing end-to-end tasks into atomic skills helps reduce data requirements and improves the task success rate. However, existing methods are limited by predefined skill sets that cannot be dynamically updated. To address the issue, we introduce a three-wheeled data-driven method to build an atomic skill library. We divide tasks into subtasks using the Vision-Language-Planning (VLP). Then, atomic skill definitions are formed by abstracting the subtasks. Finally, an atomic skill library is constructed via data collection and Vision-Language-Action (VLA) fine-tuning. As the atomic skill library expands dynamically with the three-wheel update strategy, the range of tasks it can cover grows naturally. In this way, our method shifts focus from end-to-end tasks to atomic skills, significantly reducing data costs while maintaining high performance and enabling efficient adaptation to new tasks. Extensive experiments in real-world settings demonstrate the effectiveness and efficiency of our approach.
format Preprint
id arxiv_https___arxiv_org_abs_2501_15068
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Li, Dongjiang
Peng, Bo
Li, Chang
Qiao, Ning
Zheng, Qi
Sun, Lei
Qin, Yusen
Li, Bangguo
Luan, Yifeng
Wu, Bo
Zhan, Yibing
Sun, Mingang
Xu, Tong
Li, Lusong
Shen, Hui
He, Xiaodong
Robotics
Embodied manipulation is a fundamental ability in the realm of embodied artificial intelligence. Although current embodied manipulation models show certain generalizations in specific settings, they struggle in new environments and tasks due to the complexity and diversity of real-world scenarios. The traditional end-to-end data collection and training manner leads to significant data demands. Decomposing end-to-end tasks into atomic skills helps reduce data requirements and improves the task success rate. However, existing methods are limited by predefined skill sets that cannot be dynamically updated. To address the issue, we introduce a three-wheeled data-driven method to build an atomic skill library. We divide tasks into subtasks using the Vision-Language-Planning (VLP). Then, atomic skill definitions are formed by abstracting the subtasks. Finally, an atomic skill library is constructed via data collection and Vision-Language-Action (VLA) fine-tuning. As the atomic skill library expands dynamically with the three-wheel update strategy, the range of tasks it can cover grows naturally. In this way, our method shifts focus from end-to-end tasks to atomic skills, significantly reducing data costs while maintaining high performance and enabling efficient adaptation to new tasks. Extensive experiments in real-world settings demonstrate the effectiveness and efficiency of our approach.
title An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
topic Robotics
url https://arxiv.org/abs/2501.15068