MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Liu, Ji, Zhang, Zifeng, Lu, Mingjie, Wei, Hongyang, Li, Dong, Xie, Yile, Peng, Jinzhang, Tian, Lu, Sirasao, Ashish, Barsoum, Emad
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computer Vision and Pattern Recognition
Accesso online:	https://arxiv.org/abs/2404.07821
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866913310411063296
author	Liu, Ji Zhang, Zifeng Lu, Mingjie Wei, Hongyang Li, Dong Xie, Yile Peng, Jinzhang Tian, Lu Sirasao, Ashish Barsoum, Emad
author_facet	Liu, Ji Zhang, Zifeng Lu, Mingjie Wei, Hongyang Li, Dong Xie, Yile Peng, Jinzhang Tian, Lu Sirasao, Ashish Barsoum, Emad
contents	Lane detection is a fundamental task in autonomous driving, and has achieved great progress as deep learning emerges. Previous anchor-based methods often design dense anchors, which highly depend on the training dataset and remain fixed during inference. We analyze that dense anchors are not necessary for lane detection, and propose a transformer-based lane detection framework based on a sparse anchor mechanism. To this end, we generate sparse anchors with position-aware lane queries and angle queries instead of traditional explicit anchors. We adopt Horizontal Perceptual Attention (HPA) to aggregate the lane features along the horizontal direction, and adopt Lane-Angle Cross Attention (LACA) to perform interactions between lane queries and angle queries. We also propose Lane Perceptual Attention (LPA) based on deformable cross attention to further refine the lane predictions. Our method, named Sparse Laneformer, is easy-to-implement and end-to-end trainable. Extensive experiments demonstrate that Sparse Laneformer performs favorably against the state-of-the-art methods, e.g., surpassing Laneformer by 3.0% F1 score and O2SFormer by 0.7% F1 score with fewer MACs on CULane with the same ResNet-34 backbone.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_07821
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Sparse Laneformer Liu, Ji Zhang, Zifeng Lu, Mingjie Wei, Hongyang Li, Dong Xie, Yile Peng, Jinzhang Tian, Lu Sirasao, Ashish Barsoum, Emad Computer Vision and Pattern Recognition Lane detection is a fundamental task in autonomous driving, and has achieved great progress as deep learning emerges. Previous anchor-based methods often design dense anchors, which highly depend on the training dataset and remain fixed during inference. We analyze that dense anchors are not necessary for lane detection, and propose a transformer-based lane detection framework based on a sparse anchor mechanism. To this end, we generate sparse anchors with position-aware lane queries and angle queries instead of traditional explicit anchors. We adopt Horizontal Perceptual Attention (HPA) to aggregate the lane features along the horizontal direction, and adopt Lane-Angle Cross Attention (LACA) to perform interactions between lane queries and angle queries. We also propose Lane Perceptual Attention (LPA) based on deformable cross attention to further refine the lane predictions. Our method, named Sparse Laneformer, is easy-to-implement and end-to-end trainable. Extensive experiments demonstrate that Sparse Laneformer performs favorably against the state-of-the-art methods, e.g., surpassing Laneformer by 3.0% F1 score and O2SFormer by 0.7% F1 score with fewer MACs on CULane with the same ResNet-34 backbone.
title	Sparse Laneformer
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2404.07821

Documenti analoghi