Saved in:
Bibliographic Details
Main Authors: Wu, Cho-Ying, Hu, Xiaoyan, Happold, Michael, Xu, Qiangeng, Neumann, Ulrich
Format: Preprint
Published: 2020
Subjects:
Online Access:https://arxiv.org/abs/2006.07802
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911758989393920
author Wu, Cho-Ying
Hu, Xiaoyan
Happold, Michael
Xu, Qiangeng
Neumann, Ulrich
author_facet Wu, Cho-Ying
Hu, Xiaoyan
Happold, Michael
Xu, Qiangeng
Neumann, Ulrich
contents Most previous works of outdoor instance segmentation for images only use color information. We explore a novel direction of sensor fusion to exploit stereo cameras. Geometric information from disparities helps separate overlapping objects of the same or different classes. Moreover, geometric information penalizes region proposals with unlikely 3D shapes thus suppressing false positive detections. Mask regression is based on 2D, 2.5D, and 3D ROI using the pseudo-lidar and image-based representations. These mask predictions are fused by a mask scoring process. However, public datasets only adopt stereo systems with shorter baseline and focal legnth, which limit measuring ranges of stereo cameras. We collect and utilize High-Quality Driving Stereo (HQDS) dataset, using much longer baseline and focal length with higher resolution. Our performance attains state of the art. Please refer to our project page. The full paper is available here.
format Preprint
id arxiv_https___arxiv_org_abs_2006_07802
institution arXiv
publishDate 2020
record_format arxiv
spellingShingle Geometry-Aware Instance Segmentation with Disparity Maps
Wu, Cho-Ying
Hu, Xiaoyan
Happold, Michael
Xu, Qiangeng
Neumann, Ulrich
Computer Vision and Pattern Recognition
Most previous works of outdoor instance segmentation for images only use color information. We explore a novel direction of sensor fusion to exploit stereo cameras. Geometric information from disparities helps separate overlapping objects of the same or different classes. Moreover, geometric information penalizes region proposals with unlikely 3D shapes thus suppressing false positive detections. Mask regression is based on 2D, 2.5D, and 3D ROI using the pseudo-lidar and image-based representations. These mask predictions are fused by a mask scoring process. However, public datasets only adopt stereo systems with shorter baseline and focal legnth, which limit measuring ranges of stereo cameras. We collect and utilize High-Quality Driving Stereo (HQDS) dataset, using much longer baseline and focal length with higher resolution. Our performance attains state of the art. Please refer to our project page. The full paper is available here.
title Geometry-Aware Instance Segmentation with Disparity Maps
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2006.07802