Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Ruixiao, Lee, Juheon, Cai, Xiaohao, Prugel-Bennett, Adam
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2408.12708
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911999943770112
author	Zhang, Ruixiao Lee, Juheon Cai, Xiaohao Prugel-Bennett, Adam
author_facet	Zhang, Ruixiao Lee, Juheon Cai, Xiaohao Prugel-Bennett, Adam
contents	Deep learning models such as convolutional neural networks and transformers have been widely applied to solve 3D object detection problems in the domain of autonomous driving. While existing models have achieved outstanding performance on most open benchmarks, the generalization ability of these deep networks is still in doubt. To adapt models to other domains including different cities, countries, and weather, retraining with the target domain data is currently necessary, which hinders the wide application of autonomous driving. In this paper, we deeply analyze the cross-domain performance of the state-of-the-art models. We observe that most models will overfit the training domains and it is challenging to adapt them to other domains directly. Existing domain adaptation methods for 3D object detection problems are actually shifting the models' knowledge domain instead of improving their generalization ability. We then propose additional evaluation metrics -- the side-view and front-view AP -- to better analyze the core issues of the methods' heavy drops in accuracy levels. By using the proposed metrics and further evaluating the cross-domain performance in each dimension, we conclude that the overfitting problem happens more obviously on the front-view surface and the width dimension which usually faces the sensor and has more 3D points surrounding it. Meanwhile, our experiments indicate that the density of the point cloud data also significantly influences the models' cross-domain performance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2408_12708
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection Zhang, Ruixiao Lee, Juheon Cai, Xiaohao Prugel-Bennett, Adam Computer Vision and Pattern Recognition Deep learning models such as convolutional neural networks and transformers have been widely applied to solve 3D object detection problems in the domain of autonomous driving. While existing models have achieved outstanding performance on most open benchmarks, the generalization ability of these deep networks is still in doubt. To adapt models to other domains including different cities, countries, and weather, retraining with the target domain data is currently necessary, which hinders the wide application of autonomous driving. In this paper, we deeply analyze the cross-domain performance of the state-of-the-art models. We observe that most models will overfit the training domains and it is challenging to adapt them to other domains directly. Existing domain adaptation methods for 3D object detection problems are actually shifting the models' knowledge domain instead of improving their generalization ability. We then propose additional evaluation metrics -- the side-view and front-view AP -- to better analyze the core issues of the methods' heavy drops in accuracy levels. By using the proposed metrics and further evaluating the cross-domain performance in each dimension, we conclude that the overfitting problem happens more obviously on the front-view surface and the width dimension which usually faces the sensor and has more 3D points surrounding it. Meanwhile, our experiments indicate that the density of the point cloud data also significantly influences the models' cross-domain performance.
title	Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2408.12708

Similar Items