Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jin, Yang, Lv, Jun, Yu, Wenye, Fang, Hongjie, Li, Yong-Lu, Lu, Cewu
Format:	Preprint
Published:	2025
Subjects:	Robotics Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2505.01396
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912358114263040
author	Jin, Yang Lv, Jun Yu, Wenye Fang, Hongjie Li, Yong-Lu Lu, Cewu
author_facet	Jin, Yang Lv, Jun Yu, Wenye Fang, Hongjie Li, Yong-Lu Lu, Cewu
contents	Self-improvement requires robotic systems to initially learn from human-provided data and then gradually enhance their capabilities through interaction with the environment. This is similar to how humans improve their skills through continuous practice. However, achieving effective self-improvement is challenging, primarily because robots tend to repeat their existing abilities during interactions, often failing to generate new, valuable data for learning. In this paper, we identify the key to successful self-improvement: modal-level exploration and data selection. By incorporating a modal-level exploration mechanism during policy execution, the robot can produce more diverse and multi-modal interactions. At the same time, we select the most valuable trials and high-quality segments from these interactions for learning. We successfully demonstrate effective robot self-improvement on both simulation benchmarks and real-world experiments. The capability for self-improvement will enable us to develop more robust and high-success-rate robotic control strategies at a lower cost. Our code and experiment scripts are available at https://ericjin2002.github.io/SIME/
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_01396
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	SIME: Enhancing Policy Self-Improvement with Modal-level Exploration Jin, Yang Lv, Jun Yu, Wenye Fang, Hongjie Li, Yong-Lu Lu, Cewu Robotics Artificial Intelligence Machine Learning Self-improvement requires robotic systems to initially learn from human-provided data and then gradually enhance their capabilities through interaction with the environment. This is similar to how humans improve their skills through continuous practice. However, achieving effective self-improvement is challenging, primarily because robots tend to repeat their existing abilities during interactions, often failing to generate new, valuable data for learning. In this paper, we identify the key to successful self-improvement: modal-level exploration and data selection. By incorporating a modal-level exploration mechanism during policy execution, the robot can produce more diverse and multi-modal interactions. At the same time, we select the most valuable trials and high-quality segments from these interactions for learning. We successfully demonstrate effective robot self-improvement on both simulation benchmarks and real-world experiments. The capability for self-improvement will enable us to develop more robust and high-success-rate robotic control strategies at a lower cost. Our code and experiment scripts are available at https://ericjin2002.github.io/SIME/
title	SIME: Enhancing Policy Self-Improvement with Modal-level Exploration
topic	Robotics Artificial Intelligence Machine Learning
url	https://arxiv.org/abs/2505.01396

Similar Items