Saved in:
Bibliographic Details
Main Authors: Diaz, Ryan, Imdieke, Adam, Veeriah, Vivek, Desingh, Karthik
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.14968
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909714735955968
author Diaz, Ryan
Imdieke, Adam
Veeriah, Vivek
Desingh, Karthik
author_facet Diaz, Ryan
Imdieke, Adam
Veeriah, Vivek
Desingh, Karthik
contents Operating in unstructured environments like households requires robotic policies that are robust to out-of-distribution conditions. Although much work has been done in evaluating robustness for visuomotor policies, the robustness evaluation of a multisensory approach that includes force-torque sensing remains largely unexplored. This work introduces a novel, factor-based evaluation framework with the goal of assessing the robustness of multisensory policies in a peg-in-hole assembly task. To this end, we develop a multisensory policy framework utilizing the Perceiver IO architecture to learn the task. We investigate which factors pose the greatest generalization challenges in object assembly and explore a simple multisensory data augmentation technique to enhance out-of-distribution performance. We provide a simulation environment enabling controlled evaluation of these factors. Our results reveal that multisensory variations such as Grasp Pose present the most significant challenges for robustness, and naive unisensory data augmentation applied independently to each sensory modality proves insufficient to overcome them. Additionally, we find force-torque sensing to be the most informative modality for our contact-rich assembly task, with vision being the least informative. Finally, we briefly discuss supporting real-world experimental results. For additional experiments and qualitative results, we refer to the project webpage https://rpm-lab-umn.github.io/auginsert/ .
format Preprint
id arxiv_https___arxiv_org_abs_2410_14968
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks
Diaz, Ryan
Imdieke, Adam
Veeriah, Vivek
Desingh, Karthik
Robotics
Machine Learning
Operating in unstructured environments like households requires robotic policies that are robust to out-of-distribution conditions. Although much work has been done in evaluating robustness for visuomotor policies, the robustness evaluation of a multisensory approach that includes force-torque sensing remains largely unexplored. This work introduces a novel, factor-based evaluation framework with the goal of assessing the robustness of multisensory policies in a peg-in-hole assembly task. To this end, we develop a multisensory policy framework utilizing the Perceiver IO architecture to learn the task. We investigate which factors pose the greatest generalization challenges in object assembly and explore a simple multisensory data augmentation technique to enhance out-of-distribution performance. We provide a simulation environment enabling controlled evaluation of these factors. Our results reveal that multisensory variations such as Grasp Pose present the most significant challenges for robustness, and naive unisensory data augmentation applied independently to each sensory modality proves insufficient to overcome them. Additionally, we find force-torque sensing to be the most informative modality for our contact-rich assembly task, with vision being the least informative. Finally, we briefly discuss supporting real-world experimental results. For additional experiments and qualitative results, we refer to the project webpage https://rpm-lab-umn.github.io/auginsert/ .
title AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks
topic Robotics
Machine Learning
url https://arxiv.org/abs/2410.14968