Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Yuxiang, Bai, Xuecheng, Hu, Boyu, Xu, Chuanzhi, Chen, Haodong, Chung, Vera, Li, Tingxue, Chen, Xiaoming
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2506.12697
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909735948648448
author	Wang, Yuxiang Bai, Xuecheng Hu, Boyu Xu, Chuanzhi Chen, Haodong Chung, Vera Li, Tingxue Chen, Xiaoming
author_facet	Wang, Yuxiang Bai, Xuecheng Hu, Boyu Xu, Chuanzhi Chen, Haodong Chung, Vera Li, Tingxue Chen, Xiaoming
contents	Small object detection in UAV imagery is crucial for applications such as search-and-rescue, traffic monitoring, and environmental surveillance, but it is hampered by tiny object size, low signal-to-noise ratios, and limited feature extraction. Existing multi-scale fusion methods help, but add computational burden and blur fine details, making small object detection in cluttered scenes difficult. To overcome these challenges, we propose the Multi-scale Global-detail Feature Integration Strategy (MGDFIS), a unified fusion framework that tightly couples global context with local detail to boost detection performance while maintaining efficiency. MGDFIS comprises three synergistic modules: the FusionLock-TSS Attention Module, which marries token-statistics self-attention with DynamicTanh normalization to highlight spectral and spatial cues at minimal cost; the Global-detail Integration Module, which fuses multi-scale context via directional convolution and parallel attention while preserving subtle shape and texture variations; and the Dynamic Pixel Attention Module, which generates pixel-wise weighting maps to rebalance uneven foreground and background distributions and sharpen responses to true object regions. Extensive experiments on the VisDrone benchmark demonstrate that MGDFIS consistently outperforms state-of-the-art methods across diverse backbone architectures and detection frameworks, achieving superior precision and recall with low inference time. By striking an optimal balance between accuracy and resource usage, MGDFIS provides a practical solution for small-object detection on resource-constrained UAV platforms.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_12697
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection Wang, Yuxiang Bai, Xuecheng Hu, Boyu Xu, Chuanzhi Chen, Haodong Chung, Vera Li, Tingxue Chen, Xiaoming Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning Small object detection in UAV imagery is crucial for applications such as search-and-rescue, traffic monitoring, and environmental surveillance, but it is hampered by tiny object size, low signal-to-noise ratios, and limited feature extraction. Existing multi-scale fusion methods help, but add computational burden and blur fine details, making small object detection in cluttered scenes difficult. To overcome these challenges, we propose the Multi-scale Global-detail Feature Integration Strategy (MGDFIS), a unified fusion framework that tightly couples global context with local detail to boost detection performance while maintaining efficiency. MGDFIS comprises three synergistic modules: the FusionLock-TSS Attention Module, which marries token-statistics self-attention with DynamicTanh normalization to highlight spectral and spatial cues at minimal cost; the Global-detail Integration Module, which fuses multi-scale context via directional convolution and parallel attention while preserving subtle shape and texture variations; and the Dynamic Pixel Attention Module, which generates pixel-wise weighting maps to rebalance uneven foreground and background distributions and sharpen responses to true object regions. Extensive experiments on the VisDrone benchmark demonstrate that MGDFIS consistently outperforms state-of-the-art methods across diverse backbone architectures and detection frameworks, achieving superior precision and recall with low inference time. By striking an optimal balance between accuracy and resource usage, MGDFIS provides a practical solution for small-object detection on resource-constrained UAV platforms.
title	MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection
topic	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
url	https://arxiv.org/abs/2506.12697

Similar Items