Saved in:
Bibliographic Details
Main Authors: Zhang, Ruixiao, Lee, Juheon, Cai, Xiaohao, Prugel-Bennett, Adam
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.12708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911999943770112
author Zhang, Ruixiao
Lee, Juheon
Cai, Xiaohao
Prugel-Bennett, Adam
author_facet Zhang, Ruixiao
Lee, Juheon
Cai, Xiaohao
Prugel-Bennett, Adam
contents Deep learning models such as convolutional neural networks and transformers have been widely applied to solve 3D object detection problems in the domain of autonomous driving. While existing models have achieved outstanding performance on most open benchmarks, the generalization ability of these deep networks is still in doubt. To adapt models to other domains including different cities, countries, and weather, retraining with the target domain data is currently necessary, which hinders the wide application of autonomous driving. In this paper, we deeply analyze the cross-domain performance of the state-of-the-art models. We observe that most models will overfit the training domains and it is challenging to adapt them to other domains directly. Existing domain adaptation methods for 3D object detection problems are actually shifting the models' knowledge domain instead of improving their generalization ability. We then propose additional evaluation metrics -- the side-view and front-view AP -- to better analyze the core issues of the methods' heavy drops in accuracy levels. By using the proposed metrics and further evaluating the cross-domain performance in each dimension, we conclude that the overfitting problem happens more obviously on the front-view surface and the width dimension which usually faces the sensor and has more 3D points surrounding it. Meanwhile, our experiments indicate that the density of the point cloud data also significantly influences the models' cross-domain performance.
format Preprint
id arxiv_https___arxiv_org_abs_2408_12708
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection
Zhang, Ruixiao
Lee, Juheon
Cai, Xiaohao
Prugel-Bennett, Adam
Computer Vision and Pattern Recognition
Deep learning models such as convolutional neural networks and transformers have been widely applied to solve 3D object detection problems in the domain of autonomous driving. While existing models have achieved outstanding performance on most open benchmarks, the generalization ability of these deep networks is still in doubt. To adapt models to other domains including different cities, countries, and weather, retraining with the target domain data is currently necessary, which hinders the wide application of autonomous driving. In this paper, we deeply analyze the cross-domain performance of the state-of-the-art models. We observe that most models will overfit the training domains and it is challenging to adapt them to other domains directly. Existing domain adaptation methods for 3D object detection problems are actually shifting the models' knowledge domain instead of improving their generalization ability. We then propose additional evaluation metrics -- the side-view and front-view AP -- to better analyze the core issues of the methods' heavy drops in accuracy levels. By using the proposed metrics and further evaluating the cross-domain performance in each dimension, we conclude that the overfitting problem happens more obviously on the front-view surface and the width dimension which usually faces the sensor and has more 3D points surrounding it. Meanwhile, our experiments indicate that the density of the point cloud data also significantly influences the models' cross-domain performance.
title Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2408.12708