:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Kaixuan, Xiang, Wei, Chen, Zhenshuai, Jin, Tong, Liu, Yunpeng
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.13067
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning
by: Li, He, et al.
Published: (2025)

MultiTaskVIF: Segmentation-oriented visible and infrared image fusion via multi-task learning
by: Zhao, Zixian, et al.
Published: (2025)

CAWM-Mamba: A unified model for infrared-visible image fusion and compound adverse weather restoration
by: Liu, Huichun, et al.
Published: (2026)

HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
by: Jiang, Chengjie, et al.
Published: (2024)

Gradient-based multi-focus image fusion with focus-aware saliency enhancement
by: Li, Haoyu, et al.
Published: (2025)

Fusion or Confusion? Assessing the impact of visible-thermal image fusion for automated wildlife detection
by: Dionne-Pierre, Camille, et al.
Published: (2025)

GAN-HA: A generative adversarial network with a novel heterogeneous dual-discriminator network and a new attention-based fusion strategy for infrared and visible image fusion
by: Lu, Guosheng, et al.
Published: (2024)

TFCT-I2P: Three stream fusion network with color aware transformer for image-to-point cloud registration
by: Peng, Muyao, et al.
Published: (2024)

Msmsfnet: a multi-stream and multi-scale fusion net for edge detection
by: Liu, Chenguang, et al.
Published: (2024)

Multi-scale direction-aware SAR object detection network via global information fusion
by: Cao, Mingxiang, et al.
Published: (2023)

OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation
by: Liu, Long, et al.
Published: (2025)

FS-Diff: Semantic guidance and clarity-aware simultaneous multimodal image fusion and super-resolution
by: Jie, Yuchan, et al.
Published: (2025)

PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for Low-dose CT imaging
by: Fartiyal, Jitindra, et al.
Published: (2026)

EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
by: Jin, Tong, et al.
Published: (2024)

Visible and infrared self-supervised fusion trained on a single example
by: Ofir, Nati, et al.
Published: (2023)

TriDE: Triangle-Consistent Translation Directions for Global Camera Pose Estimation
by: Chen, Francisco, et al.
Published: (2026)

FBSDiff++: Improved Frequency Band Substitution of Diffusion Features for Efficient and Highly Controllable Text-Driven Image-to-Image Translation
by: Gao, Xiang, et al.
Published: (2026)

Boundary feature fusion network for tooth image segmentation
by: Zhang, Dongping, et al.
Published: (2024)

Inhomogeneous illumination image enhancement under ex-tremely low visibility condition
by: Chen, Libang, et al.
Published: (2024)

Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era
by: Lu, Feng, et al.
Published: (2025)

GAQAT: gradient-adaptive quantization-aware training for domain generalization
by: Jiang, Jiacheng, et al.
Published: (2024)

Serial fusion of multi-modal biometric systems
by: Marcialis, Gian Luca, et al.
Published: (2024)

Category-aware EEG image generation based on wavelet transform and contrast semantic loss
by: Zhang, Enshang, et al.
Published: (2025)

Ranking-aware adapter for text-driven image ordering with CLIP
by: Yu, Wei-Hsiang, et al.
Published: (2024)

FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model
by: Xu, Yushen, et al.
Published: (2025)

multimodars: A Rust-powered toolkit for multi-modality cardiac image fusion and registration
by: Stark, Anselm W., et al.
Published: (2025)

ZoomLDM: Latent Diffusion Model for multi-scale image generation
by: Yellapragada, Srikar, et al.
Published: (2024)

Contextual fusion enhances robustness to image blurring
by: Joshi, Shruti, et al.
Published: (2024)

A multi-weight self-matching visual explanation for cnns on sar images
by: Sun, Siyuan, et al.
Published: (2025)

SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition
by: Lu, Feng, et al.
Published: (2025)

Cross-modal ultra-scale learning with tri-modalities of renal biopsy images for glomerular multi-disease auxiliary diagnosis
by: Long, Kaixing, et al.
Published: (2025)

DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Foundation Models
by: Yin, Shicheng, et al.
Published: (2025)

VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis
by: Yin, Shicheng, et al.
Published: (2024)

Depth-aware Volume Attention for Texture-less Stereo Matching
by: Zhao, Tong, et al.
Published: (2024)

Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization
by: Liu, Zhuohan, et al.
Published: (2026)

Direction-aware 3D Large Multimodal Models
by: Liu, Quan, et al.
Published: (2026)

Multi-Scale Direction-Aware Network for Infrared Small Target Detection
by: Zhao, Jinmiao, et al.
Published: (2024)

SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset
by: Roh, Changhyun, et al.
Published: (2026)

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
by: Zhang, Yifei, et al.
Published: (2025)

WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
by: Wang, Lei, et al.
Published: (2026)