Salvato in:
Dettagli Bibliografici
Autori principali: Liu, Ji, Zhang, Zifeng, Lu, Mingjie, Wei, Hongyang, Li, Dong, Xie, Yile, Peng, Jinzhang, Tian, Lu, Sirasao, Ashish, Barsoum, Emad
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2404.07821
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866913310411063296
author Liu, Ji
Zhang, Zifeng
Lu, Mingjie
Wei, Hongyang
Li, Dong
Xie, Yile
Peng, Jinzhang
Tian, Lu
Sirasao, Ashish
Barsoum, Emad
author_facet Liu, Ji
Zhang, Zifeng
Lu, Mingjie
Wei, Hongyang
Li, Dong
Xie, Yile
Peng, Jinzhang
Tian, Lu
Sirasao, Ashish
Barsoum, Emad
contents Lane detection is a fundamental task in autonomous driving, and has achieved great progress as deep learning emerges. Previous anchor-based methods often design dense anchors, which highly depend on the training dataset and remain fixed during inference. We analyze that dense anchors are not necessary for lane detection, and propose a transformer-based lane detection framework based on a sparse anchor mechanism. To this end, we generate sparse anchors with position-aware lane queries and angle queries instead of traditional explicit anchors. We adopt Horizontal Perceptual Attention (HPA) to aggregate the lane features along the horizontal direction, and adopt Lane-Angle Cross Attention (LACA) to perform interactions between lane queries and angle queries. We also propose Lane Perceptual Attention (LPA) based on deformable cross attention to further refine the lane predictions. Our method, named Sparse Laneformer, is easy-to-implement and end-to-end trainable. Extensive experiments demonstrate that Sparse Laneformer performs favorably against the state-of-the-art methods, e.g., surpassing Laneformer by 3.0% F1 score and O2SFormer by 0.7% F1 score with fewer MACs on CULane with the same ResNet-34 backbone.
format Preprint
id arxiv_https___arxiv_org_abs_2404_07821
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Sparse Laneformer
Liu, Ji
Zhang, Zifeng
Lu, Mingjie
Wei, Hongyang
Li, Dong
Xie, Yile
Peng, Jinzhang
Tian, Lu
Sirasao, Ashish
Barsoum, Emad
Computer Vision and Pattern Recognition
Lane detection is a fundamental task in autonomous driving, and has achieved great progress as deep learning emerges. Previous anchor-based methods often design dense anchors, which highly depend on the training dataset and remain fixed during inference. We analyze that dense anchors are not necessary for lane detection, and propose a transformer-based lane detection framework based on a sparse anchor mechanism. To this end, we generate sparse anchors with position-aware lane queries and angle queries instead of traditional explicit anchors. We adopt Horizontal Perceptual Attention (HPA) to aggregate the lane features along the horizontal direction, and adopt Lane-Angle Cross Attention (LACA) to perform interactions between lane queries and angle queries. We also propose Lane Perceptual Attention (LPA) based on deformable cross attention to further refine the lane predictions. Our method, named Sparse Laneformer, is easy-to-implement and end-to-end trainable. Extensive experiments demonstrate that Sparse Laneformer performs favorably against the state-of-the-art methods, e.g., surpassing Laneformer by 3.0% F1 score and O2SFormer by 0.7% F1 score with fewer MACs on CULane with the same ResNet-34 backbone.
title Sparse Laneformer
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.07821