Saved in:
Bibliographic Details
Main Authors: Wang, Shan, Nguyen, Chuong, Liu, Jiawei, Zhang, Kaihao, Luo, Wenhan, Zhang, Yanhao, Muthu, Sundaram, Maken, Fahira Afzal, Li, Hongdong
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.07626
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909166698758144
author Wang, Shan
Nguyen, Chuong
Liu, Jiawei
Zhang, Kaihao
Luo, Wenhan
Zhang, Yanhao
Muthu, Sundaram
Maken, Fahira Afzal
Li, Hongdong
author_facet Wang, Shan
Nguyen, Chuong
Liu, Jiawei
Zhang, Kaihao
Luo, Wenhan
Zhang, Yanhao
Muthu, Sundaram
Maken, Fahira Afzal
Li, Hongdong
contents Reliable segmentation of road lines and markings is critical to autonomous driving. Our work is motivated by the observations that road lines and markings are (1) frequently occluded in the presence of moving vehicles, shadow, and glare and (2) highly structured with low intra-class shape variance and overall high appearance consistency. To solve these issues, we propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues facilitating the correct classification of the partially occluded road lines or markings. To reduce computational complexity, a novel surface normal estimator is proposed to establish spatial correspondences between the sampled frames, allowing the HomoFusion module to perform a pixel-to-pixel attention mechanism in updating the representation of the occluded road lines or markings. Experiments on ApolloScape, a large-scale lane mark segmentation dataset, and ApolloScape Night with artificial simulated night-time road conditions, demonstrate that our method outperforms other existing SOTA lane mark segmentation models with less than 9\% of their parameters and computational complexity. We show that exploiting available camera intrinsic data and ground plane assumption for cross-frame correspondence can lead to a light-weight network with significantly improved performances in speed and accuracy. We also prove the versatility of our HomoFusion approach by applying it to the problem of water puddle segmentation and achieving SOTA performance.
format Preprint
id arxiv_https___arxiv_org_abs_2404_07626
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Homography Guided Temporal Fusion for Road Line and Marking Segmentation
Wang, Shan
Nguyen, Chuong
Liu, Jiawei
Zhang, Kaihao
Luo, Wenhan
Zhang, Yanhao
Muthu, Sundaram
Maken, Fahira Afzal
Li, Hongdong
Computer Vision and Pattern Recognition
Reliable segmentation of road lines and markings is critical to autonomous driving. Our work is motivated by the observations that road lines and markings are (1) frequently occluded in the presence of moving vehicles, shadow, and glare and (2) highly structured with low intra-class shape variance and overall high appearance consistency. To solve these issues, we propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues facilitating the correct classification of the partially occluded road lines or markings. To reduce computational complexity, a novel surface normal estimator is proposed to establish spatial correspondences between the sampled frames, allowing the HomoFusion module to perform a pixel-to-pixel attention mechanism in updating the representation of the occluded road lines or markings. Experiments on ApolloScape, a large-scale lane mark segmentation dataset, and ApolloScape Night with artificial simulated night-time road conditions, demonstrate that our method outperforms other existing SOTA lane mark segmentation models with less than 9\% of their parameters and computational complexity. We show that exploiting available camera intrinsic data and ground plane assumption for cross-frame correspondence can lead to a light-weight network with significantly improved performances in speed and accuracy. We also prove the versatility of our HomoFusion approach by applying it to the problem of water puddle segmentation and achieving SOTA performance.
title Homography Guided Temporal Fusion for Road Line and Marking Segmentation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.07626