Saved in:
Bibliographic Details
Main Authors: Wang, Yuxiang, Bai, Xuecheng, Hu, Boyu, Xu, Chuanzhi, Chen, Haodong, Chung, Vera, Li, Tingxue, Chen, Xiaoming
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.12697
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909735948648448
author Wang, Yuxiang
Bai, Xuecheng
Hu, Boyu
Xu, Chuanzhi
Chen, Haodong
Chung, Vera
Li, Tingxue
Chen, Xiaoming
author_facet Wang, Yuxiang
Bai, Xuecheng
Hu, Boyu
Xu, Chuanzhi
Chen, Haodong
Chung, Vera
Li, Tingxue
Chen, Xiaoming
contents Small object detection in UAV imagery is crucial for applications such as search-and-rescue, traffic monitoring, and environmental surveillance, but it is hampered by tiny object size, low signal-to-noise ratios, and limited feature extraction. Existing multi-scale fusion methods help, but add computational burden and blur fine details, making small object detection in cluttered scenes difficult. To overcome these challenges, we propose the Multi-scale Global-detail Feature Integration Strategy (MGDFIS), a unified fusion framework that tightly couples global context with local detail to boost detection performance while maintaining efficiency. MGDFIS comprises three synergistic modules: the FusionLock-TSS Attention Module, which marries token-statistics self-attention with DynamicTanh normalization to highlight spectral and spatial cues at minimal cost; the Global-detail Integration Module, which fuses multi-scale context via directional convolution and parallel attention while preserving subtle shape and texture variations; and the Dynamic Pixel Attention Module, which generates pixel-wise weighting maps to rebalance uneven foreground and background distributions and sharpen responses to true object regions. Extensive experiments on the VisDrone benchmark demonstrate that MGDFIS consistently outperforms state-of-the-art methods across diverse backbone architectures and detection frameworks, achieving superior precision and recall with low inference time. By striking an optimal balance between accuracy and resource usage, MGDFIS provides a practical solution for small-object detection on resource-constrained UAV platforms.
format Preprint
id arxiv_https___arxiv_org_abs_2506_12697
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection
Wang, Yuxiang
Bai, Xuecheng
Hu, Boyu
Xu, Chuanzhi
Chen, Haodong
Chung, Vera
Li, Tingxue
Chen, Xiaoming
Computer Vision and Pattern Recognition
Artificial Intelligence
Machine Learning
Small object detection in UAV imagery is crucial for applications such as search-and-rescue, traffic monitoring, and environmental surveillance, but it is hampered by tiny object size, low signal-to-noise ratios, and limited feature extraction. Existing multi-scale fusion methods help, but add computational burden and blur fine details, making small object detection in cluttered scenes difficult. To overcome these challenges, we propose the Multi-scale Global-detail Feature Integration Strategy (MGDFIS), a unified fusion framework that tightly couples global context with local detail to boost detection performance while maintaining efficiency. MGDFIS comprises three synergistic modules: the FusionLock-TSS Attention Module, which marries token-statistics self-attention with DynamicTanh normalization to highlight spectral and spatial cues at minimal cost; the Global-detail Integration Module, which fuses multi-scale context via directional convolution and parallel attention while preserving subtle shape and texture variations; and the Dynamic Pixel Attention Module, which generates pixel-wise weighting maps to rebalance uneven foreground and background distributions and sharpen responses to true object regions. Extensive experiments on the VisDrone benchmark demonstrate that MGDFIS consistently outperforms state-of-the-art methods across diverse backbone architectures and detection frameworks, achieving superior precision and recall with low inference time. By striking an optimal balance between accuracy and resource usage, MGDFIS provides a practical solution for small-object detection on resource-constrained UAV platforms.
title MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection
topic Computer Vision and Pattern Recognition
Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2506.12697