Saved in:
Bibliographic Details
Main Authors: Kim, Seungku, Jang, Suhyeok, Yoon, Byungjun, Kim, Dongyoung, Won, John, Shin, Jinwoo
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.18742
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908846447919104
author Kim, Seungku
Jang, Suhyeok
Yoon, Byungjun
Kim, Dongyoung
Won, John
Shin, Jinwoo
author_facet Kim, Seungku
Jang, Suhyeok
Yoon, Byungjun
Kim, Dongyoung
Won, John
Shin, Jinwoo
contents Synthetic data generated by video generative models has shown promise for robot learning as a scalable pipeline, but it often suffers from inconsistent action quality due to imperfectly generated videos. Recently, vision-language models (VLMs) have been leveraged to validate video quality, but they have limitations in distinguishing physically accurate videos and, even then, cannot directly evaluate the generated actions themselves. To tackle this issue, we introduce RoboCurate, a novel synthetic robot data generation framework that evaluates and filters the quality of annotated actions by comparing them with simulation replay. Specifically, RoboCurate replays the predicted actions in a simulator and assesses action quality by measuring the consistency of motion between the simulator rollout and the generated video. In addition, we unlock observation diversity beyond the available dataset via image-to-image editing and apply action-preserving video-to-video transfer to further augment appearance. We observe RoboCurate's generated data yield substantial relative improvements in success rates compared to using real data only, achieving +70.1% on GR-1 Tabletop (300 demos), +16.1% on DexMimicGen in the pre-training setup, and +179.9% in the challenging real-world ALLEX humanoid dexterous manipulation setting.
format Preprint
id arxiv_https___arxiv_org_abs_2602_18742
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning
Kim, Seungku
Jang, Suhyeok
Yoon, Byungjun
Kim, Dongyoung
Won, John
Shin, Jinwoo
Robotics
Artificial Intelligence
Computer Vision and Pattern Recognition
Synthetic data generated by video generative models has shown promise for robot learning as a scalable pipeline, but it often suffers from inconsistent action quality due to imperfectly generated videos. Recently, vision-language models (VLMs) have been leveraged to validate video quality, but they have limitations in distinguishing physically accurate videos and, even then, cannot directly evaluate the generated actions themselves. To tackle this issue, we introduce RoboCurate, a novel synthetic robot data generation framework that evaluates and filters the quality of annotated actions by comparing them with simulation replay. Specifically, RoboCurate replays the predicted actions in a simulator and assesses action quality by measuring the consistency of motion between the simulator rollout and the generated video. In addition, we unlock observation diversity beyond the available dataset via image-to-image editing and apply action-preserving video-to-video transfer to further augment appearance. We observe RoboCurate's generated data yield substantial relative improvements in success rates compared to using real data only, achieving +70.1% on GR-1 Tabletop (300 demos), +16.1% on DexMimicGen in the pre-training setup, and +179.9% in the challenging real-world ALLEX humanoid dexterous manipulation setting.
title RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning
topic Robotics
Artificial Intelligence
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.18742