Enregistré dans:
Détails bibliographiques
Auteurs principaux: Huang, Ze, Xiao, Zhongyang, Song, Mingliang, Yang, Longan, Yuan, Hongyuan, Sun, Li
Format: Preprint
Publié: 2026
Sujets:
Accès en ligne:https://arxiv.org/abs/2602.00810
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866914298612154368
author Huang, Ze
Xiao, Zhongyang
Song, Mingliang
Yang, Longan
Yuan, Hongyuan
Sun, Li
author_facet Huang, Ze
Xiao, Zhongyang
Song, Mingliang
Yang, Longan
Yuan, Hongyuan
Sun, Li
contents Localization is a critical technology in autonomous driving, encompassing both topological localization, which identifies the most similar map keyframe to the current observation, and metric localization, which provides precise spatial coordinates. Conventional methods typically address these tasks independently, rely on single-camera setups, and often require additional 3D semantic or pose priors, while lacking mechanisms to quantify the confidence of localization results, making them less feasible for real industrial applications. In this paper, we propose VVLoc, a unified pipeline that employs a single neural network to concurrently achieve topological and metric vehicle localization using multi-camera system. VVLoc first evaluates the geo-proximity between visual observations, then estimates their relative metric poses using a matching strategy, while also providing a confidence measure. Additionally, the training process for VVLoc is highly efficient, requiring only pairs of visual data and corresponding ground-truth poses, eliminating the need for complex supplementary data. We evaluate VVLoc not only on the publicly available datasets, but also on a more challenging self-collected dataset, demonstrating its ability to deliver state-of-the-art localization accuracy across a wide range of localization tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2602_00810
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle VVLoc: Prior-free 3-DoF Vehicle Visual Localization
Huang, Ze
Xiao, Zhongyang
Song, Mingliang
Yang, Longan
Yuan, Hongyuan
Sun, Li
Computer Vision and Pattern Recognition
Robotics
Localization is a critical technology in autonomous driving, encompassing both topological localization, which identifies the most similar map keyframe to the current observation, and metric localization, which provides precise spatial coordinates. Conventional methods typically address these tasks independently, rely on single-camera setups, and often require additional 3D semantic or pose priors, while lacking mechanisms to quantify the confidence of localization results, making them less feasible for real industrial applications. In this paper, we propose VVLoc, a unified pipeline that employs a single neural network to concurrently achieve topological and metric vehicle localization using multi-camera system. VVLoc first evaluates the geo-proximity between visual observations, then estimates their relative metric poses using a matching strategy, while also providing a confidence measure. Additionally, the training process for VVLoc is highly efficient, requiring only pairs of visual data and corresponding ground-truth poses, eliminating the need for complex supplementary data. We evaluate VVLoc not only on the publicly available datasets, but also on a more challenging self-collected dataset, demonstrating its ability to deliver state-of-the-art localization accuracy across a wide range of localization tasks.
title VVLoc: Prior-free 3-DoF Vehicle Visual Localization
topic Computer Vision and Pattern Recognition
Robotics
url https://arxiv.org/abs/2602.00810