Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Mai, Jinjie, Hamdi, Abdullah, Giancola, Silvio, Zhao, Chen, Ghanem, Bernard
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2407.08023
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866929417509404672
author	Mai, Jinjie Hamdi, Abdullah Giancola, Silvio Zhao, Chen Ghanem, Bernard
author_facet	Mai, Jinjie Hamdi, Abdullah Giancola, Silvio Zhao, Chen Ghanem, Bernard
contents	We built our pipeline EgoLoc-v1, mainly inspired by EgoLoc. We propose a model ensemble strategy to improve the camera pose estimation part of the VQ3D task, which has been proven to be essential in previous work. The core idea is not only to do SfM for egocentric videos but also to do 2D-3D matching between existing 3D scans and 2D video frames. In this way, we have a hybrid SfM and camera relocalization pipeline, which can provide us with more camera poses, leading to higher QwP and overall success rate. Our method achieves the best performance regarding the most important metric, the overall success rate. We surpass previous state-of-the-art, the competitive EgoLoc, by $1.5\%$. The code is available at \url{https://github.com/Wayne-Mai/egoloc_v1}.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_08023
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization Mai, Jinjie Hamdi, Abdullah Giancola, Silvio Zhao, Chen Ghanem, Bernard Computer Vision and Pattern Recognition We built our pipeline EgoLoc-v1, mainly inspired by EgoLoc. We propose a model ensemble strategy to improve the camera pose estimation part of the VQ3D task, which has been proven to be essential in previous work. The core idea is not only to do SfM for egocentric videos but also to do 2D-3D matching between existing 3D scans and 2D video frames. In this way, we have a hybrid SfM and camera relocalization pipeline, which can provide us with more camera poses, leading to higher QwP and overall success rate. Our method achieves the best performance regarding the most important metric, the overall success rate. We surpass previous state-of-the-art, the competitive EgoLoc, by $1.5\%$. The code is available at \url{https://github.com/Wayne-Mai/egoloc_v1}.
title	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2407.08023

Ähnliche Einträge