Saved in:
| Main Authors: | Wang, Leyang, Lin, Joice |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.16376 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PIG: Prompt Images Guidance for Night-Time Scene Parsing
by: Xie, Zhifeng, et al.
Published: (2024)
by: Xie, Zhifeng, et al.
Published: (2024)
Lightweight Facial Landmark Detection in Thermal Images via Multi-Level Cross-Modal Knowledge Transfer
by: Tong, Qiyi, et al.
Published: (2025)
by: Tong, Qiyi, et al.
Published: (2025)
Neuromorphic Facial Analysis with Cross-Modal Supervision
by: Becattini, Federico, et al.
Published: (2024)
by: Becattini, Federico, et al.
Published: (2024)
PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
by: Wan, Jiansong, et al.
Published: (2025)
by: Wan, Jiansong, et al.
Published: (2025)
Personalized Cross-Modal Emotional Correlation Learning for Speech-Preserving Facial Expression Manipulation
by: Chen, Tianshui, et al.
Published: (2026)
by: Chen, Tianshui, et al.
Published: (2026)
From Cross-Modal to Mixed-Modal Visible-Infrared Re-Identification
by: Alehdaghi, Mahdi, et al.
Published: (2025)
by: Alehdaghi, Mahdi, et al.
Published: (2025)
Visible-Infrared Person Re-Identification via Patch-Mixed Cross-Modality Learning
by: Qian, Zhihao, et al.
Published: (2023)
by: Qian, Zhihao, et al.
Published: (2023)
VT-Intrinsic: Physics-Based Decomposition of Reflectance and Shading using a Single Visible-Thermal Image Pair
by: Yuan, Zeqing, et al.
Published: (2025)
by: Yuan, Zeqing, et al.
Published: (2025)
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
by: Purushwalkam, Senthil, et al.
Published: (2024)
by: Purushwalkam, Senthil, et al.
Published: (2024)
T-FAKE: Synthesizing Thermal Images for Facial Landmarking
by: Flotho, Philipp, et al.
Published: (2024)
by: Flotho, Philipp, et al.
Published: (2024)
Beyond Strict Pairing: Arbitrarily Paired Training for High-Performance Infrared and Visible Image Fusion
by: Deng, Yanglin, et al.
Published: (2026)
by: Deng, Yanglin, et al.
Published: (2026)
CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images
by: Hu, Bin, et al.
Published: (2025)
by: Hu, Bin, et al.
Published: (2025)
CM-Bench: A Comprehensive Cross-Modal Feature Matching Benchmark Bridging Visible and Infrared Images
by: Sun, Liangzheng, et al.
Published: (2026)
by: Sun, Liangzheng, et al.
Published: (2026)
Cross-Modal Causal Intervention for Medical Report Generation
by: Chen, Weixing, et al.
Published: (2023)
by: Chen, Weixing, et al.
Published: (2023)
VisIRNet: Deep Image Alignment for UAV-taken Visible and Infrared Image Pairs
by: Ozer, Sedat, et al.
Published: (2024)
by: Ozer, Sedat, et al.
Published: (2024)
AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
by: Lin, Yiheng, et al.
Published: (2025)
by: Lin, Yiheng, et al.
Published: (2025)
PIG: Physically-based Multi-Material Interaction with 3D Gaussians
by: Xiao, Zeyu, et al.
Published: (2025)
by: Xiao, Zeyu, et al.
Published: (2025)
Language-Depth Navigated Thermal and Visible Image Fusion
by: Zhang, Jinchang, et al.
Published: (2025)
by: Zhang, Jinchang, et al.
Published: (2025)
CFCPalsy: Facial Image Synthesis with Cross-Fusion Cycle Diffusion Model for Facial Paralysis Individuals
by: Gao, Weixiang, et al.
Published: (2024)
by: Gao, Weixiang, et al.
Published: (2024)
Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation
by: Hu, Guanyu, et al.
Published: (2024)
by: Hu, Guanyu, et al.
Published: (2024)
Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval
by: Han, Haochen, et al.
Published: (2024)
by: Han, Haochen, et al.
Published: (2024)
Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification
by: Liang, Tengfei, et al.
Published: (2023)
by: Liang, Tengfei, et al.
Published: (2023)
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
by: Tong, Xi, et al.
Published: (2025)
by: Tong, Xi, et al.
Published: (2025)
Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images
by: Pei, Shufan, et al.
Published: (2024)
by: Pei, Shufan, et al.
Published: (2024)
Continual Cross-Modal Generalization
by: Xia, Yan, et al.
Published: (2025)
by: Xia, Yan, et al.
Published: (2025)
FCDFusion: a Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs
by: Li, Hesong, et al.
Published: (2024)
by: Li, Hesong, et al.
Published: (2024)
Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
by: Kim, SiWoo, et al.
Published: (2025)
by: Kim, SiWoo, et al.
Published: (2025)
Unsupervised Visible-Infrared ReID via Pseudo-label Correction and Modality-level Alignment
by: Liu, Yexin, et al.
Published: (2024)
by: Liu, Yexin, et al.
Published: (2024)
Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
by: Guo, Lin, et al.
Published: (2025)
by: Guo, Lin, et al.
Published: (2025)
UNIV: Unified Foundation Model for Infrared and Visible Modalities
by: Mao, Fangyuan, et al.
Published: (2025)
by: Mao, Fangyuan, et al.
Published: (2025)
Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID
by: Cheng, De, et al.
Published: (2023)
by: Cheng, De, et al.
Published: (2023)
Thermal-Det: Language-Guided Cross-Modal Distillation for Open-Vocabulary Thermal Object Detection
by: Ranasinghe, Yasiru, et al.
Published: (2026)
by: Ranasinghe, Yasiru, et al.
Published: (2026)
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
by: Huang, Hailang, et al.
Published: (2024)
by: Huang, Hailang, et al.
Published: (2024)
Image Translation-Based Unsupervised Cross-Modality Domain Adaptation for Medical Image Segmentation
by: Yang, Tao, et al.
Published: (2025)
by: Yang, Tao, et al.
Published: (2025)
Causality-Driven Infrared and Visible Image Fusion
by: Ma, Linli, et al.
Published: (2025)
by: Ma, Linli, et al.
Published: (2025)
Cross Modality Image Translation In Medical Imaging Using Generative Frameworks
by: Romoli, Giulia, et al.
Published: (2026)
by: Romoli, Giulia, et al.
Published: (2026)
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
by: Wang, Zeyu, et al.
Published: (2024)
by: Wang, Zeyu, et al.
Published: (2024)
Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification
by: Yang, Xi, et al.
Published: (2024)
by: Yang, Xi, et al.
Published: (2024)
Adaptive Domain Shift in Diffusion Models for Cross-Modality Image Translation
by: Wang, Zihao, et al.
Published: (2026)
by: Wang, Zihao, et al.
Published: (2026)
PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks
by: Li, Junxian, et al.
Published: (2026)
by: Li, Junxian, et al.
Published: (2026)
Similar Items
-
PIG: Prompt Images Guidance for Night-Time Scene Parsing
by: Xie, Zhifeng, et al.
Published: (2024) -
Lightweight Facial Landmark Detection in Thermal Images via Multi-Level Cross-Modal Knowledge Transfer
by: Tong, Qiyi, et al.
Published: (2025) -
Neuromorphic Facial Analysis with Cross-Modal Supervision
by: Becattini, Federico, et al.
Published: (2024) -
PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
by: Wan, Jiansong, et al.
Published: (2025) -
Personalized Cross-Modal Emotional Correlation Learning for Speech-Preserving Facial Expression Manipulation
by: Chen, Tianshui, et al.
Published: (2026)