Saved in:
| Main Authors: | Han, Jianhong, Wang, Yupei, Zhang, Yuan, Chen, Liang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.00363 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection
by: Han, Jianhong, et al.
Published: (2025)
by: Han, Jianhong, et al.
Published: (2025)
VFM-Guided Semi-Supervised Detection Transformer under Source-Free Constraints for Remote Sensing Object Detection
by: Han, Jianhong, et al.
Published: (2025)
by: Han, Jianhong, et al.
Published: (2025)
DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment
by: Han, Jianhong, et al.
Published: (2024)
by: Han, Jianhong, et al.
Published: (2024)
MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization
by: Xing, Yinghui, et al.
Published: (2023)
by: Xing, Yinghui, et al.
Published: (2023)
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
by: Hu, Xiaoxing, et al.
Published: (2025)
by: Hu, Xiaoxing, et al.
Published: (2025)
MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object Detection
by: Zhan, Yue, et al.
Published: (2024)
by: Zhan, Yue, et al.
Published: (2024)
Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion
by: Sun, Hui, et al.
Published: (2025)
by: Sun, Hui, et al.
Published: (2025)
Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion
by: Zhu, Yixin, et al.
Published: (2026)
by: Zhu, Yixin, et al.
Published: (2026)
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
by: Yu, Ning, et al.
Published: (2022)
by: Yu, Ning, et al.
Published: (2022)
DG-DETR: Toward Domain Generalized Detection Transformer
by: Hwang, Seongmin, et al.
Published: (2025)
by: Hwang, Seongmin, et al.
Published: (2025)
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
by: Jia, Ding, et al.
Published: (2024)
by: Jia, Ding, et al.
Published: (2024)
MMR-Mamba: Multi-Modal MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion
by: Zou, Jing, et al.
Published: (2024)
by: Zou, Jing, et al.
Published: (2024)
FMC-DETR: Frequency-Decoupled Multi-Domain Coordination for Aerial-View Object Detection
by: Liang, Ben, et al.
Published: (2025)
by: Liang, Ben, et al.
Published: (2025)
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection
by: Chen, Qiang, et al.
Published: (2024)
by: Chen, Qiang, et al.
Published: (2024)
Mr. DETR++: Instructive Multi-Route Training for Detection Transformers with Mixture-of-Experts
by: Zhang, Chang-Bin, et al.
Published: (2024)
by: Zhang, Chang-Bin, et al.
Published: (2024)
FMRFT: Fusion Mamba and DETR for Query Time Sequence Intersection Fish Tracking
by: Yao, Mingyuan, et al.
Published: (2024)
by: Yao, Mingyuan, et al.
Published: (2024)
Center-Aware Detection with Swin-based Co-DETR Framework for Cervical Cytology
by: Kong, Yan, et al.
Published: (2026)
by: Kong, Yan, et al.
Published: (2026)
CGF-DETR: Cross-Gated Fusion DETR for Enhanced Pneumonia Detection in Chest X-rays
by: Wu, Yefeng, et al.
Published: (2025)
by: Wu, Yefeng, et al.
Published: (2025)
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
by: Hu, Xuzhong, et al.
Published: (2025)
by: Hu, Xuzhong, et al.
Published: (2025)
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
by: Huang, Yi-Xin, et al.
Published: (2024)
by: Huang, Yi-Xin, et al.
Published: (2024)
Dual-R-DETR: Resolving Query Competition with Pairwise Routing in Transformer Decoders
by: Zhang, Ye, et al.
Published: (2025)
by: Zhang, Ye, et al.
Published: (2025)
RT-DETR++ for UAV Object Detection
by: Shufang, Yuan
Published: (2025)
by: Shufang, Yuan
Published: (2025)
OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer
by: Wang, Yu, et al.
Published: (2024)
by: Wang, Yu, et al.
Published: (2024)
UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection
by: Li, Wei, et al.
Published: (2025)
by: Li, Wei, et al.
Published: (2025)
CAF-Mamba: Mamba-Based Cross-Modal Adaptive Attention Fusion for Multimodal Depression Detection
by: Zhou, Bowen, et al.
Published: (2026)
by: Zhou, Bowen, et al.
Published: (2026)
WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer
by: Yin, Huilin, et al.
Published: (2025)
by: Yin, Huilin, et al.
Published: (2025)
MS-DETR: Efficient DETR Training with Mixed Supervision
by: Zhao, Chuyang, et al.
Published: (2024)
by: Zhao, Chuyang, et al.
Published: (2024)
WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection
by: Zhu, Haodong, et al.
Published: (2025)
by: Zhu, Haodong, et al.
Published: (2025)
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
by: Hu, Zhangchi, et al.
Published: (2025)
by: Hu, Zhangchi, et al.
Published: (2025)
D$^3$R-DETR: DETR with Dual-Domain Density Refinement for Tiny Object Detection in Aerial Images
by: Wen, Zixiao, et al.
Published: (2026)
by: Wen, Zixiao, et al.
Published: (2026)
Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
by: Hou, Xiuquan, et al.
Published: (2024)
by: Hou, Xiuquan, et al.
Published: (2024)
FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba
by: Xie, Xinyu, et al.
Published: (2024)
by: Xie, Xinyu, et al.
Published: (2024)
DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection
by: Ye, Jiaxin, et al.
Published: (2024)
by: Ye, Jiaxin, et al.
Published: (2024)
Why mamba is effective? Exploit Linear Transformer-Mamba Network for Multi-Modality Image Fusion
by: Zhu, Chenguang, et al.
Published: (2024)
by: Zhu, Chenguang, et al.
Published: (2024)
Laplace-Mamba: Laplace Frequency Prior-Guided Mamba-CNN Fusion Network for Image Dehazing
by: Wang, Yongzhen, et al.
Published: (2025)
by: Wang, Yongzhen, et al.
Published: (2025)
Le-DETR: Revisiting Real-Time Detection Transformer with Efficient Encoder Design
by: Huang, Jiannan, et al.
Published: (2026)
by: Huang, Jiannan, et al.
Published: (2026)
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
by: Wang, Yu, et al.
Published: (2022)
by: Wang, Yu, et al.
Published: (2022)
MGHFT: Multi-Granularity Hierarchical Fusion Transformer for Cross-Modal Sticker Emotion Recognition
by: Chen, Jian, et al.
Published: (2025)
by: Chen, Jian, et al.
Published: (2025)
CROME: Cross-Modal Adapters for Efficient Multimodal LLM
by: Ebrahimi, Sayna, et al.
Published: (2024)
by: Ebrahimi, Sayna, et al.
Published: (2024)
DFIR-DETR: Frequency-Domain Iterative Refinement and Dynamic Feature Aggregation for Small Object Detection
by: Gao, Bo, et al.
Published: (2025)
by: Gao, Bo, et al.
Published: (2025)
Similar Items
-
Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection
by: Han, Jianhong, et al.
Published: (2025) -
VFM-Guided Semi-Supervised Detection Transformer under Source-Free Constraints for Remote Sensing Object Detection
by: Han, Jianhong, et al.
Published: (2025) -
DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment
by: Han, Jianhong, et al.
Published: (2024) -
MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization
by: Xing, Yinghui, et al.
Published: (2023) -
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
by: Hu, Xiaoxing, et al.
Published: (2025)