Saved in:
Bibliographic Details
Main Authors: Li, Hongyang, Li, Yang, Wang, Huijie, Zeng, Jia, Xu, Huilin, Cai, Pinlong, Chen, Li, Yan, Junchi, Xu, Feng, Xiong, Lu, Wang, Jingdong, Zhu, Futang, Xu, Chunjing, Wang, Tiancai, Xia, Fei, Mu, Beipeng, Peng, Zhihui, Lin, Dahua, Qiao, Yu
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2312.03408
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913277517234176
author Li, Hongyang
Li, Yang
Wang, Huijie
Zeng, Jia
Xu, Huilin
Cai, Pinlong
Chen, Li
Yan, Junchi
Xu, Feng
Xiong, Lu
Wang, Jingdong
Zhu, Futang
Xu, Chunjing
Wang, Tiancai
Xia, Fei
Mu, Beipeng
Peng, Zhihui
Lin, Dahua
Qiao, Yu
author_facet Li, Hongyang
Li, Yang
Wang, Huijie
Zeng, Jia
Xu, Huilin
Cai, Pinlong
Chen, Li
Yan, Junchi
Xu, Feng
Xiong, Lu
Wang, Jingdong
Zhu, Futang
Xu, Chunjing
Wang, Tiancai
Xia, Fei
Mu, Beipeng
Peng, Zhihui
Lin, Dahua
Qiao, Yu
contents With the continuous maturation and application of autonomous driving technology, a systematic examination of open-source autonomous driving datasets becomes instrumental in fostering the robust evolution of the industry ecosystem. Current autonomous driving datasets can broadly be categorized into two generations. The first-generation autonomous driving datasets are characterized by relatively simpler sensor modalities, smaller data scale, and is limited to perception-level tasks. KITTI, introduced in 2012, serves as a prominent representative of this initial wave. In contrast, the second-generation datasets exhibit heightened complexity in sensor modalities, greater data scale and diversity, and an expansion of tasks from perception to encompass prediction and control. Leading examples of the second generation include nuScenes and Waymo, introduced around 2019. This comprehensive review, conducted in collaboration with esteemed colleagues from both academia and industry, systematically assesses over seventy open-source autonomous driving datasets from domestic and international sources. It offers insights into various aspects, such as the principles underlying the creation of high-quality datasets, the pivotal role of data engine systems, and the utilization of generative foundation models to facilitate scalable data generation. Furthermore, this review undertakes an exhaustive analysis and discourse regarding the characteristics and data scales that future third-generation autonomous driving datasets should possess. It also delves into the scientific and technical challenges that warrant resolution. These endeavors are pivotal in advancing autonomous innovation and fostering technological enhancement in critical domains. For further details, please refer to https://github.com/OpenDriveLab/DriveAGI.
format Preprint
id arxiv_https___arxiv_org_abs_2312_03408
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future
Li, Hongyang
Li, Yang
Wang, Huijie
Zeng, Jia
Xu, Huilin
Cai, Pinlong
Chen, Li
Yan, Junchi
Xu, Feng
Xiong, Lu
Wang, Jingdong
Zhu, Futang
Xu, Chunjing
Wang, Tiancai
Xia, Fei
Mu, Beipeng
Peng, Zhihui
Lin, Dahua
Qiao, Yu
Computer Vision and Pattern Recognition
With the continuous maturation and application of autonomous driving technology, a systematic examination of open-source autonomous driving datasets becomes instrumental in fostering the robust evolution of the industry ecosystem. Current autonomous driving datasets can broadly be categorized into two generations. The first-generation autonomous driving datasets are characterized by relatively simpler sensor modalities, smaller data scale, and is limited to perception-level tasks. KITTI, introduced in 2012, serves as a prominent representative of this initial wave. In contrast, the second-generation datasets exhibit heightened complexity in sensor modalities, greater data scale and diversity, and an expansion of tasks from perception to encompass prediction and control. Leading examples of the second generation include nuScenes and Waymo, introduced around 2019. This comprehensive review, conducted in collaboration with esteemed colleagues from both academia and industry, systematically assesses over seventy open-source autonomous driving datasets from domestic and international sources. It offers insights into various aspects, such as the principles underlying the creation of high-quality datasets, the pivotal role of data engine systems, and the utilization of generative foundation models to facilitate scalable data generation. Furthermore, this review undertakes an exhaustive analysis and discourse regarding the characteristics and data scales that future third-generation autonomous driving datasets should possess. It also delves into the scientific and technical challenges that warrant resolution. These endeavors are pivotal in advancing autonomous innovation and fostering technological enhancement in critical domains. For further details, please refer to https://github.com/OpenDriveLab/DriveAGI.
title Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2312.03408