Saved in:
Bibliographic Details
Main Authors: Wang, Yuchao, Cheng, Peirui, Tian, Pengju, Yuan, Ziyang, Zhao, Liangjin, Tian, Jing, Wang, Wensheng, Wang, Zhirui, Sun, Xian
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.04647
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909219161112576
author Wang, Yuchao
Cheng, Peirui
Tian, Pengju
Yuan, Ziyang
Zhao, Liangjin
Tian, Jing
Wang, Wensheng
Wang, Zhirui
Sun, Xian
author_facet Wang, Yuchao
Cheng, Peirui
Tian, Pengju
Yuan, Ziyang
Zhao, Liangjin
Tian, Jing
Wang, Wensheng
Wang, Zhirui
Sun, Xian
contents With the advancement of collaborative perception, the role of aerial-ground collaborative perception, a crucial component, is becoming increasingly important. The demand for collaborative perception across different perspectives to construct more comprehensive perceptual information is growing. However, challenges arise due to the disparities in the field of view between cross-domain agents and their varying sensitivity to information in images. Additionally, when we transform image features into Bird's Eye View (BEV) features for collaboration, we need accurate depth information. To address these issues, we propose a framework specifically designed for aerial-ground collaboration. First, to mitigate the lack of datasets for aerial-ground collaboration, we develop a virtual dataset named V2U-COO for our research. Second, we design a Cross-Domain Cross-Adaptation (CDCA) module to align the target information obtained from different domains, thereby achieving more accurate perception results. Finally, we introduce a Collaborative Depth Optimization (CDO) module to obtain more precise depth estimation results, leading to more accurate perception outcomes. We conduct extensive experiments on both our virtual dataset and a public dataset to validate the effectiveness of our framework. Our experiments on the V2U-COO dataset and the DAIR-V2X dataset demonstrate that our method improves detection accuracy by 6.1% and 2.7%, respectively.
format Preprint
id arxiv_https___arxiv_org_abs_2406_04647
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection
Wang, Yuchao
Cheng, Peirui
Tian, Pengju
Yuan, Ziyang
Zhao, Liangjin
Tian, Jing
Wang, Wensheng
Wang, Zhirui
Sun, Xian
Computer Vision and Pattern Recognition
With the advancement of collaborative perception, the role of aerial-ground collaborative perception, a crucial component, is becoming increasingly important. The demand for collaborative perception across different perspectives to construct more comprehensive perceptual information is growing. However, challenges arise due to the disparities in the field of view between cross-domain agents and their varying sensitivity to information in images. Additionally, when we transform image features into Bird's Eye View (BEV) features for collaboration, we need accurate depth information. To address these issues, we propose a framework specifically designed for aerial-ground collaboration. First, to mitigate the lack of datasets for aerial-ground collaboration, we develop a virtual dataset named V2U-COO for our research. Second, we design a Cross-Domain Cross-Adaptation (CDCA) module to align the target information obtained from different domains, thereby achieving more accurate perception results. Finally, we introduce a Collaborative Depth Optimization (CDO) module to obtain more precise depth estimation results, leading to more accurate perception outcomes. We conduct extensive experiments on both our virtual dataset and a public dataset to validate the effectiveness of our framework. Our experiments on the V2U-COO dataset and the DAIR-V2X dataset demonstrate that our method improves detection accuracy by 6.1% and 2.7%, respectively.
title UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2406.04647