Saved in:
Bibliographic Details
Main Authors: Jian, Pingcheng, Lee, Easop, Bell, Zachary, Zavlanos, Michael M., Chen, Boyuan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.19971
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913630798217216
author Jian, Pingcheng
Lee, Easop
Bell, Zachary
Zavlanos, Michael M.
Chen, Boyuan
author_facet Jian, Pingcheng
Lee, Easop
Bell, Zachary
Zavlanos, Michael M.
Chen, Boyuan
contents Vision-based imitation learning has shown promising capabilities of endowing robots with various motion skills given visual observation. However, current visuomotor policies fail to adapt to drastic changes in their visual observations. We present Perception Stitching that enables strong zero-shot adaptation to large visual changes by directly stitching novel combinations of visual encoders. Our key idea is to enforce modularity of visual encoders by aligning the latent visual features among different visuomotor policies. Our method disentangles the perceptual knowledge with the downstream motion skills and allows the reuse of the visual encoders by directly stitching them to a policy network trained with partially different visual conditions. We evaluate our method in various simulated and real-world manipulation tasks. While baseline methods failed at all attempts, our method could achieve zero-shot success in real-world visuomotor tasks. Our quantitative and qualitative analysis of the learned features of the policy network provides more insights into the high performance of our proposed method.
format Preprint
id arxiv_https___arxiv_org_abs_2406_19971
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Perception Stitching: Zero-Shot Perception Encoder Transfer for Visuomotor Robot Policies
Jian, Pingcheng
Lee, Easop
Bell, Zachary
Zavlanos, Michael M.
Chen, Boyuan
Robotics
Vision-based imitation learning has shown promising capabilities of endowing robots with various motion skills given visual observation. However, current visuomotor policies fail to adapt to drastic changes in their visual observations. We present Perception Stitching that enables strong zero-shot adaptation to large visual changes by directly stitching novel combinations of visual encoders. Our key idea is to enforce modularity of visual encoders by aligning the latent visual features among different visuomotor policies. Our method disentangles the perceptual knowledge with the downstream motion skills and allows the reuse of the visual encoders by directly stitching them to a policy network trained with partially different visual conditions. We evaluate our method in various simulated and real-world manipulation tasks. While baseline methods failed at all attempts, our method could achieve zero-shot success in real-world visuomotor tasks. Our quantitative and qualitative analysis of the learned features of the policy network provides more insights into the high performance of our proposed method.
title Perception Stitching: Zero-Shot Perception Encoder Transfer for Visuomotor Robot Policies
topic Robotics
url https://arxiv.org/abs/2406.19971