Saved in:
Bibliographic Details
Main Authors: Hong, Shilong, Zhou, Yanzhou, Xu, Weichao
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.01710
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909600591118336
author Hong, Shilong
Zhou, Yanzhou
Xu, Weichao
author_facet Hong, Shilong
Zhou, Yanzhou
Xu, Weichao
contents With the rapid development of modern transportation systems and the exponential growth of logistics volumes, intelligent X-ray-based security inspection systems play a crucial role in public safety. Although single-view X-ray baggage scanner is widely deployed, they struggles to accurately identify contraband in complex stacking scenarios due to strong viewpoint dependency and inadequate feature representation. To address this, we propose a Dual-View Attention-Guided Network for Efficient X-ray Security Inspection (DAGNet). This study builds on a shared-weight backbone network as the foundation and constructs three key modules that work together: (1) Frequency Domain Interaction Module (FDIM) dynamically enhances features by adjusting frequency components based on inter-view relationships; (2) Dual-View Hierarchical Enhancement Module (DVHEM) employs cross-attention to align features between views and capture hierarchical associations; (3) Convolutional Guided Fusion Module (CGFM) fuses features to suppress redundancy while retaining critical discriminative information. Collectively, these modules substantially improve the performance of dual-view X-ray security inspection. Experimental results demonstrate that DAGNet outperforms existing state-of-the-art approaches across multiple backbone architectures. The code is available at:https://github.com/ShilongHong/DAGNet.
format Preprint
id arxiv_https___arxiv_org_abs_2502_01710
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle DAGNet: A Dual-View Attention-Guided Network for Efficient X-ray Security Inspection
Hong, Shilong
Zhou, Yanzhou
Xu, Weichao
Computer Vision and Pattern Recognition
With the rapid development of modern transportation systems and the exponential growth of logistics volumes, intelligent X-ray-based security inspection systems play a crucial role in public safety. Although single-view X-ray baggage scanner is widely deployed, they struggles to accurately identify contraband in complex stacking scenarios due to strong viewpoint dependency and inadequate feature representation. To address this, we propose a Dual-View Attention-Guided Network for Efficient X-ray Security Inspection (DAGNet). This study builds on a shared-weight backbone network as the foundation and constructs three key modules that work together: (1) Frequency Domain Interaction Module (FDIM) dynamically enhances features by adjusting frequency components based on inter-view relationships; (2) Dual-View Hierarchical Enhancement Module (DVHEM) employs cross-attention to align features between views and capture hierarchical associations; (3) Convolutional Guided Fusion Module (CGFM) fuses features to suppress redundancy while retaining critical discriminative information. Collectively, these modules substantially improve the performance of dual-view X-ray security inspection. Experimental results demonstrate that DAGNet outperforms existing state-of-the-art approaches across multiple backbone architectures. The code is available at:https://github.com/ShilongHong/DAGNet.
title DAGNet: A Dual-View Attention-Guided Network for Efficient X-ray Security Inspection
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2502.01710