Saved in:
Bibliographic Details
Main Authors: Wang, Shanshan, Xu, Haixiang, Feng, Hui, Wang, Xiaoqian, Song, Pei, Liu, Sijie, He, Jianhua
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.04835
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908304848977920
author Wang, Shanshan
Xu, Haixiang
Feng, Hui
Wang, Xiaoqian
Song, Pei
Liu, Sijie
He, Jianhua
author_facet Wang, Shanshan
Xu, Haixiang
Feng, Hui
Wang, Xiaoqian
Song, Pei
Liu, Sijie
He, Jianhua
contents The success of deep learning in intelligent ship visual perception relies heavily on rich image data. However, dedicated datasets for inland waterway vessels remain scarce, limiting the adaptability of visual perception systems in complex environments. Inland waterways, characterized by narrow channels, variable weather, and urban interference, pose significant challenges to object detection systems based on existing datasets. To address these issues, this paper introduces the Multi-environment Inland Waterway Vessel Dataset (MEIWVD), comprising 32,478 high-quality images from diverse scenarios, including sunny, rainy, foggy, and artificial lighting conditions. MEIWVD covers common vessel types in the Yangtze River Basin, emphasizing diversity, sample independence, environmental complexity, and multi-scale characteristics, making it a robust benchmark for vessel detection. Leveraging MEIWVD, this paper proposes a scene-guided image enhancement module to improve water surface images based on environmental conditions adaptively. Additionally, a parameter-limited dilated convolution enhances the representation of vessel features, while a multi-scale dilated residual fusion method integrates multi-scale features for better detection. Experiments show that MEIWVD provides a more rigorous benchmark for object detection algorithms, and the proposed methods significantly improve detector performance, especially in complex multi-environment scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2504_04835
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Inland Waterway Object Detection in Multi-environment: Dataset and Approach
Wang, Shanshan
Xu, Haixiang
Feng, Hui
Wang, Xiaoqian
Song, Pei
Liu, Sijie
He, Jianhua
Computer Vision and Pattern Recognition
The success of deep learning in intelligent ship visual perception relies heavily on rich image data. However, dedicated datasets for inland waterway vessels remain scarce, limiting the adaptability of visual perception systems in complex environments. Inland waterways, characterized by narrow channels, variable weather, and urban interference, pose significant challenges to object detection systems based on existing datasets. To address these issues, this paper introduces the Multi-environment Inland Waterway Vessel Dataset (MEIWVD), comprising 32,478 high-quality images from diverse scenarios, including sunny, rainy, foggy, and artificial lighting conditions. MEIWVD covers common vessel types in the Yangtze River Basin, emphasizing diversity, sample independence, environmental complexity, and multi-scale characteristics, making it a robust benchmark for vessel detection. Leveraging MEIWVD, this paper proposes a scene-guided image enhancement module to improve water surface images based on environmental conditions adaptively. Additionally, a parameter-limited dilated convolution enhances the representation of vessel features, while a multi-scale dilated residual fusion method integrates multi-scale features for better detection. Experiments show that MEIWVD provides a more rigorous benchmark for object detection algorithms, and the proposed methods significantly improve detector performance, especially in complex multi-environment scenarios.
title Inland Waterway Object Detection in Multi-environment: Dataset and Approach
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2504.04835