Saved in:
Bibliographic Details
Main Authors: Erdoğan, Fatih, Barın, Merve Rabia, Güney, Fatma
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.21689
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911130180386816
author Erdoğan, Fatih
Barın, Merve Rabia
Güney, Fatma
author_facet Erdoğan, Fatih
Barın, Merve Rabia
Güney, Fatma
contents Constructing high-definition (HD) maps from sensory input requires accurately mapping the road elements in image space to the Bird's Eye View (BEV) space. The precision of this mapping directly impacts the quality of the final vectorized HD map. Existing HD mapping approaches outsource the projection to standard mapping techniques, such as attention-based ones. However, these methods struggle with accuracy due to generalization problems, often hallucinating non-existent road elements. Our key idea is to start with a geometric mapping based on camera parameters and adapt it to the scene to extract relevant map information from camera images. To implement this, we propose a novel probabilistic projection mechanism with confidence scores to (i) refine the mapping to better align with the scene and (ii) filter out irrelevant elements that should not influence HD map generation. In addition, we improve temporal processing by using confidence scores to selectively accumulate reliable information over time. Experiments on new splits of the nuScenes and Argoverse2 datasets demonstrate improved performance over state-of-the-art approaches, indicating better generalization. The improvements are particularly pronounced on nuScenes and in the challenging long perception range. Our code and model checkpoints are available at https://github.com/Fatih-Erdogan/mapping-like-skeptic .
format Preprint
id arxiv_https___arxiv_org_abs_2508_21689
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
Erdoğan, Fatih
Barın, Merve Rabia
Güney, Fatma
Computer Vision and Pattern Recognition
Constructing high-definition (HD) maps from sensory input requires accurately mapping the road elements in image space to the Bird's Eye View (BEV) space. The precision of this mapping directly impacts the quality of the final vectorized HD map. Existing HD mapping approaches outsource the projection to standard mapping techniques, such as attention-based ones. However, these methods struggle with accuracy due to generalization problems, often hallucinating non-existent road elements. Our key idea is to start with a geometric mapping based on camera parameters and adapt it to the scene to extract relevant map information from camera images. To implement this, we propose a novel probabilistic projection mechanism with confidence scores to (i) refine the mapping to better align with the scene and (ii) filter out irrelevant elements that should not influence HD map generation. In addition, we improve temporal processing by using confidence scores to selectively accumulate reliable information over time. Experiments on new splits of the nuScenes and Argoverse2 datasets demonstrate improved performance over state-of-the-art approaches, indicating better generalization. The improvements are particularly pronounced on nuScenes and in the challenging long perception range. Our code and model checkpoints are available at https://github.com/Fatih-Erdogan/mapping-like-skeptic .
title Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2508.21689