Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Che, Yuchen, Furukawa, Ryo, Kanezaki, Asako
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2408.16547
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909942130147328
author Che, Yuchen
Furukawa, Ryo
Kanezaki, Asako
author_facet Che, Yuchen
Furukawa, Ryo
Kanezaki, Asako
contents Category-level articulated object pose estimation focuses on the pose estimation of unknown articulated objects within known categories. Despite its significance, this task remains challenging due to the varying shapes and poses of objects, expensive dataset annotation costs, and complex real-world environments. In this paper, we propose a novel self-supervised approach that leverages a single-frame point cloud to solve this task. Our model consistently generates reconstruction with a canonical pose and joint state for the entire input object, and it estimates object-level poses that reduce overall pose variance and part-level poses that align each part of the input with its corresponding part of the reconstruction. Experimental results demonstrate that our approach significantly outperforms previous self-supervised methods and is comparable to the state-of-the-art supervised methods. To assess the performance of our model in real-world scenarios, we also introduce a new real-world articulated object benchmark dataset.
format Preprint
id arxiv_https___arxiv_org_abs_2408_16547
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation
Che, Yuchen
Furukawa, Ryo
Kanezaki, Asako
Computer Vision and Pattern Recognition
Category-level articulated object pose estimation focuses on the pose estimation of unknown articulated objects within known categories. Despite its significance, this task remains challenging due to the varying shapes and poses of objects, expensive dataset annotation costs, and complex real-world environments. In this paper, we propose a novel self-supervised approach that leverages a single-frame point cloud to solve this task. Our model consistently generates reconstruction with a canonical pose and joint state for the entire input object, and it estimates object-level poses that reduce overall pose variance and part-level poses that align each part of the input with its corresponding part of the reconstruction. Experimental results demonstrate that our approach significantly outperforms previous self-supervised methods and is comparable to the state-of-the-art supervised methods. To assess the performance of our model in real-world scenarios, we also introduce a new real-world articulated object benchmark dataset.
title OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2408.16547