:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Leyang, Lin, Joice
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.16376
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PIG: Prompt Images Guidance for Night-Time Scene Parsing
by: Xie, Zhifeng, et al.
Published: (2024)

Lightweight Facial Landmark Detection in Thermal Images via Multi-Level Cross-Modal Knowledge Transfer
by: Tong, Qiyi, et al.
Published: (2025)

Neuromorphic Facial Analysis with Cross-Modal Supervision
by: Becattini, Federico, et al.
Published: (2024)

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
by: Wan, Jiansong, et al.
Published: (2025)

Personalized Cross-Modal Emotional Correlation Learning for Speech-Preserving Facial Expression Manipulation
by: Chen, Tianshui, et al.
Published: (2026)

From Cross-Modal to Mixed-Modal Visible-Infrared Re-Identification
by: Alehdaghi, Mahdi, et al.
Published: (2025)

Visible-Infrared Person Re-Identification via Patch-Mixed Cross-Modality Learning
by: Qian, Zhihao, et al.
Published: (2023)

VT-Intrinsic: Physics-Based Decomposition of Reflectance and Shading using a Single Visible-Thermal Image Pair
by: Yuan, Zeqing, et al.
Published: (2025)

BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
by: Purushwalkam, Senthil, et al.
Published: (2024)

T-FAKE: Synthesizing Thermal Images for Facial Landmarking
by: Flotho, Philipp, et al.
Published: (2024)

Beyond Strict Pairing: Arbitrarily Paired Training for High-Performance Infrared and Visible Image Fusion
by: Deng, Yanglin, et al.
Published: (2026)

CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images
by: Hu, Bin, et al.
Published: (2025)

CM-Bench: A Comprehensive Cross-Modal Feature Matching Benchmark Bridging Visible and Infrared Images
by: Sun, Liangzheng, et al.
Published: (2026)

Cross-Modal Causal Intervention for Medical Report Generation
by: Chen, Weixing, et al.
Published: (2023)

VisIRNet: Deep Image Alignment for UAV-taken Visible and Infrared Image Pairs
by: Ozer, Sedat, et al.
Published: (2024)

AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
by: Lin, Yiheng, et al.
Published: (2025)

PIG: Physically-based Multi-Material Interaction with 3D Gaussians
by: Xiao, Zeyu, et al.
Published: (2025)

Language-Depth Navigated Thermal and Visible Image Fusion
by: Zhang, Jinchang, et al.
Published: (2025)

CFCPalsy: Facial Image Synthesis with Cross-Fusion Cycle Diffusion Model for Facial Paralysis Individuals
by: Gao, Weixiang, et al.
Published: (2024)

Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation
by: Hu, Guanyu, et al.
Published: (2024)

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval
by: Han, Haochen, et al.
Published: (2024)

Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification
by: Liang, Tengfei, et al.
Published: (2023)

SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
by: Tong, Xi, et al.
Published: (2025)

Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images
by: Pei, Shufan, et al.
Published: (2024)

Continual Cross-Modal Generalization
by: Xia, Yan, et al.
Published: (2025)

FCDFusion: a Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs
by: Li, Hesong, et al.
Published: (2024)

Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
by: Kim, SiWoo, et al.
Published: (2025)

Unsupervised Visible-Infrared ReID via Pseudo-label Correction and Modality-level Alignment
by: Liu, Yexin, et al.
Published: (2024)

Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
by: Guo, Lin, et al.
Published: (2025)

UNIV: Unified Foundation Model for Infrared and Visible Modalities
by: Mao, Fangyuan, et al.
Published: (2025)

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID
by: Cheng, De, et al.
Published: (2023)

Thermal-Det: Language-Guided Cross-Modal Distillation for Open-Vocabulary Thermal Object Detection
by: Ranasinghe, Yasiru, et al.
Published: (2026)

Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
by: Huang, Hailang, et al.
Published: (2024)

Image Translation-Based Unsupervised Cross-Modality Domain Adaptation for Medical Image Segmentation
by: Yang, Tao, et al.
Published: (2025)

Causality-Driven Infrared and Visible Image Fusion
by: Ma, Linli, et al.
Published: (2025)

Cross Modality Image Translation In Medical Imaging Using Generative Frameworks
by: Romoli, Giulia, et al.
Published: (2026)

DiffX: Guide Your Layout to Cross-Modal Generative Modeling
by: Wang, Zeyu, et al.
Published: (2024)

Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification
by: Yang, Xi, et al.
Published: (2024)

Adaptive Domain Shift in Diffusion Models for Cross-Modality Image Translation
by: Wang, Zihao, et al.
Published: (2026)

PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks
by: Li, Junxian, et al.
Published: (2026)