Saved in:
| Main Authors: | Yang, Kaixuan, Xiang, Wei, Chen, Zhenshuai, Jin, Tong, Liu, Yunpeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.13067 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning
by: Li, He, et al.
Published: (2025)
by: Li, He, et al.
Published: (2025)
MultiTaskVIF: Segmentation-oriented visible and infrared image fusion via multi-task learning
by: Zhao, Zixian, et al.
Published: (2025)
by: Zhao, Zixian, et al.
Published: (2025)
CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration
by: Liu, Huichun, et al.
Published: (2026)
by: Liu, Huichun, et al.
Published: (2026)
HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
by: Jiang, Chengjie, et al.
Published: (2024)
by: Jiang, Chengjie, et al.
Published: (2024)
Gradient-based multi-focus image fusion with focus-aware saliency enhancement
by: Li, Haoyu, et al.
Published: (2025)
by: Li, Haoyu, et al.
Published: (2025)
Fusion or Confusion? Assessing the impact of visible-thermal image fusion for automated wildlife detection
by: Dionne-Pierre, Camille, et al.
Published: (2025)
by: Dionne-Pierre, Camille, et al.
Published: (2025)
GAN-HA: A generative adversarial network with a novel heterogeneous dual-discriminator network and a new attention-based fusion strategy for infrared and visible image fusion
by: Lu, Guosheng, et al.
Published: (2024)
by: Lu, Guosheng, et al.
Published: (2024)
TFCT-I2P: Three stream fusion network with color aware transformer for image-to-point cloud registration
by: Peng, Muyao, et al.
Published: (2024)
by: Peng, Muyao, et al.
Published: (2024)
Msmsfnet: a multi-stream and multi-scale fusion net for edge detection
by: Liu, Chenguang, et al.
Published: (2024)
by: Liu, Chenguang, et al.
Published: (2024)
Multi-scale direction-aware SAR object detection network via global information fusion
by: Cao, Mingxiang, et al.
Published: (2023)
by: Cao, Mingxiang, et al.
Published: (2023)
OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation
by: Liu, Long, et al.
Published: (2025)
by: Liu, Long, et al.
Published: (2025)
FS-Diff: Semantic guidance and clarity-aware simultaneous multimodal image fusion and super-resolution
by: Jie, Yuchan, et al.
Published: (2025)
by: Jie, Yuchan, et al.
Published: (2025)
PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for Low-dose CT imaging
by: Fartiyal, Jitindra, et al.
Published: (2026)
by: Fartiyal, Jitindra, et al.
Published: (2026)
EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
by: Jin, Tong, et al.
Published: (2024)
by: Jin, Tong, et al.
Published: (2024)
Visible and infrared self-supervised fusion trained on a single example
by: Ofir, Nati, et al.
Published: (2023)
by: Ofir, Nati, et al.
Published: (2023)
TriDE: Triangle-Consistent Translation Directions for Global Camera Pose Estimation
by: Chen, Francisco, et al.
Published: (2026)
by: Chen, Francisco, et al.
Published: (2026)
FBSDiff++: Improved Frequency Band Substitution of Diffusion Features for Efficient and Highly Controllable Text-Driven Image-to-Image Translation
by: Gao, Xiang, et al.
Published: (2026)
by: Gao, Xiang, et al.
Published: (2026)
Boundary feature fusion network for tooth image segmentation
by: Zhang, Dongping, et al.
Published: (2024)
by: Zhang, Dongping, et al.
Published: (2024)
Inhomogeneous illumination image enhancement under ex-tremely low visibility condition
by: Chen, Libang, et al.
Published: (2024)
by: Chen, Libang, et al.
Published: (2024)
Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era
by: Lu, Feng, et al.
Published: (2025)
by: Lu, Feng, et al.
Published: (2025)
GAQAT: gradient-adaptive quantization-aware training for domain generalization
by: Jiang, Jiacheng, et al.
Published: (2024)
by: Jiang, Jiacheng, et al.
Published: (2024)
Serial fusion of multi-modal biometric systems
by: Marcialis, Gian Luca, et al.
Published: (2024)
by: Marcialis, Gian Luca, et al.
Published: (2024)
Category-aware EEG image generation based on wavelet transform and contrast semantic loss
by: Zhang, Enshang, et al.
Published: (2025)
by: Zhang, Enshang, et al.
Published: (2025)
Ranking-aware adapter for text-driven image ordering with CLIP
by: Yu, Wei-Hsiang, et al.
Published: (2024)
by: Yu, Wei-Hsiang, et al.
Published: (2024)
FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model
by: Xu, Yushen, et al.
Published: (2025)
by: Xu, Yushen, et al.
Published: (2025)
multimodars: A Rust-powered toolkit for multi-modality cardiac image fusion and registration
by: Stark, Anselm W., et al.
Published: (2025)
by: Stark, Anselm W., et al.
Published: (2025)
ZoomLDM: Latent Diffusion Model for multi-scale image generation
by: Yellapragada, Srikar, et al.
Published: (2024)
by: Yellapragada, Srikar, et al.
Published: (2024)
Contextual fusion enhances robustness to image blurring
by: Joshi, Shruti, et al.
Published: (2024)
by: Joshi, Shruti, et al.
Published: (2024)
A multi-weight self-matching visual explanation for cnns on sar images
by: Sun, Siyuan, et al.
Published: (2025)
by: Sun, Siyuan, et al.
Published: (2025)
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition
by: Lu, Feng, et al.
Published: (2025)
by: Lu, Feng, et al.
Published: (2025)
Cross-modal ultra-scale learning with tri-modalities of renal biopsy images for glomerular multi-disease auxiliary diagnosis
by: Long, Kaixing, et al.
Published: (2025)
by: Long, Kaixing, et al.
Published: (2025)
DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Foundation Models
by: Yin, Shicheng, et al.
Published: (2025)
by: Yin, Shicheng, et al.
Published: (2025)
VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis
by: Yin, Shicheng, et al.
Published: (2024)
by: Yin, Shicheng, et al.
Published: (2024)
Depth-aware Volume Attention for Texture-less Stereo Matching
by: Zhao, Tong, et al.
Published: (2024)
by: Zhao, Tong, et al.
Published: (2024)
Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization
by: Liu, Zhuohan, et al.
Published: (2026)
by: Liu, Zhuohan, et al.
Published: (2026)
Direction-aware 3D Large Multimodal Models
by: Liu, Quan, et al.
Published: (2026)
by: Liu, Quan, et al.
Published: (2026)
Multi-Scale Direction-Aware Network for Infrared Small Target Detection
by: Zhao, Jinmiao, et al.
Published: (2024)
by: Zhao, Jinmiao, et al.
Published: (2024)
SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset
by: Roh, Changhyun, et al.
Published: (2026)
by: Roh, Changhyun, et al.
Published: (2026)
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
by: Zhang, Yifei, et al.
Published: (2025)
by: Zhang, Yifei, et al.
Published: (2025)
WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
by: Wang, Lei, et al.
Published: (2026)
by: Wang, Lei, et al.
Published: (2026)
Similar Items
-
FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning
by: Li, He, et al.
Published: (2025) -
MultiTaskVIF: Segmentation-oriented visible and infrared image fusion via multi-task learning
by: Zhao, Zixian, et al.
Published: (2025) -
CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration
by: Liu, Huichun, et al.
Published: (2026) -
HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
by: Jiang, Chengjie, et al.
Published: (2024) -
Gradient-based multi-focus image fusion with focus-aware saliency enhancement
by: Li, Haoyu, et al.
Published: (2025)