Saved in:
Bibliographic Details
Main Authors: Li, Alan, Schoellig, Angela P.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.04279
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913610596352000
author Li, Alan
Schoellig, Angela P.
author_facet Li, Alan
Schoellig, Angela P.
contents 6D Object pose estimation is a fundamental component in robotics enabling efficient interaction with the environment. It is particularly challenging in bin-picking applications, where objects may be textureless and in difficult poses, and occlusion between objects of the same type may cause confusion even in well-trained models. We propose a novel method of hard example synthesis that is model-agnostic, using existing simulators and the modeling of pose error in both the camera-to-object viewsphere and occlusion space. Through evaluation of the model performance with respect to the distribution of object poses and occlusions, we discover regions of high error and generate realistic training samples to specifically target these regions. With our training approach, we demonstrate an improvement in correct detection rate of up to 20% across several ROBI-dataset objects using state-of-the-art pose estimation models.
format Preprint
id arxiv_https___arxiv_org_abs_2412_04279
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Targeted Hard Sample Synthesis Based on Estimated Pose and Occlusion Error for Improved Object Pose Estimation
Li, Alan
Schoellig, Angela P.
Computer Vision and Pattern Recognition
6D Object pose estimation is a fundamental component in robotics enabling efficient interaction with the environment. It is particularly challenging in bin-picking applications, where objects may be textureless and in difficult poses, and occlusion between objects of the same type may cause confusion even in well-trained models. We propose a novel method of hard example synthesis that is model-agnostic, using existing simulators and the modeling of pose error in both the camera-to-object viewsphere and occlusion space. Through evaluation of the model performance with respect to the distribution of object poses and occlusions, we discover regions of high error and generate realistic training samples to specifically target these regions. With our training approach, we demonstrate an improvement in correct detection rate of up to 20% across several ROBI-dataset objects using state-of-the-art pose estimation models.
title Targeted Hard Sample Synthesis Based on Estimated Pose and Occlusion Error for Improved Object Pose Estimation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2412.04279