Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Huang, Ze, Xiao, Zhongyang, Song, Mingliang, Yang, Longan, Yuan, Hongyuan, Sun, Li
Format:	Preprint
Publié:	2026
Sujets:	Computer Vision and Pattern Recognition Robotics
Accès en ligne:	https://arxiv.org/abs/2602.00810
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866914298612154368
author	Huang, Ze Xiao, Zhongyang Song, Mingliang Yang, Longan Yuan, Hongyuan Sun, Li
author_facet	Huang, Ze Xiao, Zhongyang Song, Mingliang Yang, Longan Yuan, Hongyuan Sun, Li
contents	Localization is a critical technology in autonomous driving, encompassing both topological localization, which identifies the most similar map keyframe to the current observation, and metric localization, which provides precise spatial coordinates. Conventional methods typically address these tasks independently, rely on single-camera setups, and often require additional 3D semantic or pose priors, while lacking mechanisms to quantify the confidence of localization results, making them less feasible for real industrial applications. In this paper, we propose VVLoc, a unified pipeline that employs a single neural network to concurrently achieve topological and metric vehicle localization using multi-camera system. VVLoc first evaluates the geo-proximity between visual observations, then estimates their relative metric poses using a matching strategy, while also providing a confidence measure. Additionally, the training process for VVLoc is highly efficient, requiring only pairs of visual data and corresponding ground-truth poses, eliminating the need for complex supplementary data. We evaluate VVLoc not only on the publicly available datasets, but also on a more challenging self-collected dataset, demonstrating its ability to deliver state-of-the-art localization accuracy across a wide range of localization tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_00810
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	VVLoc: Prior-free 3-DoF Vehicle Visual Localization Huang, Ze Xiao, Zhongyang Song, Mingliang Yang, Longan Yuan, Hongyuan Sun, Li Computer Vision and Pattern Recognition Robotics Localization is a critical technology in autonomous driving, encompassing both topological localization, which identifies the most similar map keyframe to the current observation, and metric localization, which provides precise spatial coordinates. Conventional methods typically address these tasks independently, rely on single-camera setups, and often require additional 3D semantic or pose priors, while lacking mechanisms to quantify the confidence of localization results, making them less feasible for real industrial applications. In this paper, we propose VVLoc, a unified pipeline that employs a single neural network to concurrently achieve topological and metric vehicle localization using multi-camera system. VVLoc first evaluates the geo-proximity between visual observations, then estimates their relative metric poses using a matching strategy, while also providing a confidence measure. Additionally, the training process for VVLoc is highly efficient, requiring only pairs of visual data and corresponding ground-truth poses, eliminating the need for complex supplementary data. We evaluate VVLoc not only on the publicly available datasets, but also on a more challenging self-collected dataset, demonstrating its ability to deliver state-of-the-art localization accuracy across a wide range of localization tasks.
title	VVLoc: Prior-free 3-DoF Vehicle Visual Localization
topic	Computer Vision and Pattern Recognition Robotics
url	https://arxiv.org/abs/2602.00810

Documents similaires