Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jiang, Zhou, Zhu, Zhenxin, Li, Pengfei, Gao, Huan-ang, Yuan, Tianyuan, Shi, Yongliang, Zhao, Hang, Zhao, Hao
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.10521
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916183453728768
author	Jiang, Zhou Zhu, Zhenxin Li, Pengfei Gao, Huan-ang Yuan, Tianyuan Shi, Yongliang Zhao, Hang Zhao, Hao
author_facet	Jiang, Zhou Zhu, Zhenxin Li, Pengfei Gao, Huan-ang Yuan, Tianyuan Shi, Yongliang Zhao, Hang Zhao, Hao
contents	Autonomous vehicles are gradually entering city roads today, with the help of high-definition maps (HDMaps). However, the reliance on HDMaps prevents autonomous vehicles from stepping into regions without this expensive digital infrastructure. This fact drives many researchers to study online HDMap generation algorithms, but the performance of these algorithms at far regions is still unsatisfying. We present P-MapNet, in which the letter P highlights the fact that we focus on incorporating map priors to improve model performance. Specifically, we exploit priors in both SDMap and HDMap. On one hand, we extract weakly aligned SDMap from OpenStreetMap, and encode it as an additional conditioning branch. Despite the misalignment challenge, our attention-based architecture adaptively attends to relevant SDMap skeletons and significantly improves performance. On the other hand, we exploit a masked autoencoder to capture the prior distribution of HDMap, which can serve as a refinement module to mitigate occlusions and artifacts. We benchmark on the nuScenes and Argoverse2 datasets. Through comprehensive experiments, we show that: (1) our SDMap prior can improve online map generation performance, using both rasterized (by up to $+18.73$ $\rm mIoU$) and vectorized (by up to $+8.50$ $\rm mAP$) output representations. (2) our HDMap prior can improve map perceptual metrics by up to $6.34\%$. (3) P-MapNet can be switched into different inference modes that covers different regions of the accuracy-efficiency trade-off landscape. (4) P-MapNet is a far-seeing solution that brings larger improvements on longer ranges. Codes and models are publicly available at https://jike5.github.io/P-MapNet.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_10521
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors Jiang, Zhou Zhu, Zhenxin Li, Pengfei Gao, Huan-ang Yuan, Tianyuan Shi, Yongliang Zhao, Hang Zhao, Hao Computer Vision and Pattern Recognition Autonomous vehicles are gradually entering city roads today, with the help of high-definition maps (HDMaps). However, the reliance on HDMaps prevents autonomous vehicles from stepping into regions without this expensive digital infrastructure. This fact drives many researchers to study online HDMap generation algorithms, but the performance of these algorithms at far regions is still unsatisfying. We present P-MapNet, in which the letter P highlights the fact that we focus on incorporating map priors to improve model performance. Specifically, we exploit priors in both SDMap and HDMap. On one hand, we extract weakly aligned SDMap from OpenStreetMap, and encode it as an additional conditioning branch. Despite the misalignment challenge, our attention-based architecture adaptively attends to relevant SDMap skeletons and significantly improves performance. On the other hand, we exploit a masked autoencoder to capture the prior distribution of HDMap, which can serve as a refinement module to mitigate occlusions and artifacts. We benchmark on the nuScenes and Argoverse2 datasets. Through comprehensive experiments, we show that: (1) our SDMap prior can improve online map generation performance, using both rasterized (by up to $+18.73$ $\rm mIoU$) and vectorized (by up to $+8.50$ $\rm mAP$) output representations. (2) our HDMap prior can improve map perceptual metrics by up to $6.34\%$. (3) P-MapNet can be switched into different inference modes that covers different regions of the accuracy-efficiency trade-off landscape. (4) P-MapNet is a far-seeing solution that brings larger improvements on longer ranges. Codes and models are publicly available at https://jike5.github.io/P-MapNet.
title	P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2403.10521

Similar Items