Saved in:
| Main Authors: | Zhang, Yue, Zhuo, Zhizheng, Xu, Siyao, Lv, Shan, Liu, Zhaoxi, Qiu, Jun, Wang, Qiuli, Liu, Yaou, Zhou, S. Kevin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.19723 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unified Multi-Modal Image Synthesis for Missing Modality Imputation
by: Zhang, Yue, et al.
Published: (2023)
by: Zhang, Yue, et al.
Published: (2023)
Collaborative Multi-Modal Coding for High-Quality 3D Generation
by: Cao, Ziang, et al.
Published: (2025)
by: Cao, Ziang, et al.
Published: (2025)
Towards Open Domain Text-Driven Synthesis of Multi-Person Motions
by: Shan, Mengyi, et al.
Published: (2024)
by: Shan, Mengyi, et al.
Published: (2024)
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
by: Liu, Meilin, et al.
Published: (2026)
by: Liu, Meilin, et al.
Published: (2026)
Triple-Phase Sequential Fusion Network for Hepatobiliary Phase Liver MRI Synthesis
by: Wang, Qiuli, et al.
Published: (2026)
by: Wang, Qiuli, et al.
Published: (2026)
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
by: Lv, Zhengyao, et al.
Published: (2025)
by: Lv, Zhengyao, et al.
Published: (2025)
Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion
by: Fan, Fuxin, et al.
Published: (2024)
by: Fan, Fuxin, et al.
Published: (2024)
Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction
by: Mu, Zhaoxi, et al.
Published: (2024)
by: Mu, Zhaoxi, et al.
Published: (2024)
AG-TAL: Anatomically-Guided Topology-Aware Loss for Multiclass Segmentation of the Circle of Willis Using Large-Scale Multi-Center Datasets
by: Liu, Jialu, et al.
Published: (2026)
by: Liu, Jialu, et al.
Published: (2026)
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis
by: Ren, Jingjing, et al.
Published: (2023)
by: Ren, Jingjing, et al.
Published: (2023)
Towards Model-Agnostic Dataset Condensation by Heterogeneous Models
by: Moon, Jun-Yeong, et al.
Published: (2024)
by: Moon, Jun-Yeong, et al.
Published: (2024)
Mamba-Based Modality Disentanglement Network for Multi-Contrast MRI Reconstruction
by: Lyu, Weiyi, et al.
Published: (2025)
by: Lyu, Weiyi, et al.
Published: (2025)
IntraStyler: Intra-Domain Style Synthesis for Cross-Modality MRI Domain Adaptation
by: Liu, Han, et al.
Published: (2026)
by: Liu, Han, et al.
Published: (2026)
Better with Less: Tackling Heterogeneous Multi-Modal Image Joint Pretraining via Conditioned and Degraded Masked Autoencoder
by: Peng, Bowen, et al.
Published: (2026)
by: Peng, Bowen, et al.
Published: (2026)
MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text
by: Xu, Ronghao, et al.
Published: (2025)
by: Xu, Ronghao, et al.
Published: (2025)
Modality Alignment across Trees on Heterogeneous Hyperbolic Manifolds
by: Wu, Wei, et al.
Published: (2025)
by: Wu, Wei, et al.
Published: (2025)
Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering
by: Yao, Jiawei, et al.
Published: (2024)
by: Yao, Jiawei, et al.
Published: (2024)
Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding
by: Liu, Yi, et al.
Published: (2024)
by: Liu, Yi, et al.
Published: (2024)
AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation
by: Qiu, Lu, et al.
Published: (2025)
by: Qiu, Lu, et al.
Published: (2025)
Towards Modality Generalization: A Benchmark and Prospective Analysis
by: Liu, Xiaohao, et al.
Published: (2024)
by: Liu, Xiaohao, et al.
Published: (2024)
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
by: Liu, Zhizheng, et al.
Published: (2024)
by: Liu, Zhizheng, et al.
Published: (2024)
Joint Optimization for 4D Human-Scene Reconstruction in the Wild
by: Liu, Zhizheng, et al.
Published: (2025)
by: Liu, Zhizheng, et al.
Published: (2025)
Bidirectional Learning of Facial Action Units and Expressions via Structured Semantic Mapping across Heterogeneous Datasets
by: Li, Jia, et al.
Published: (2026)
by: Li, Jia, et al.
Published: (2026)
WFM: 3D Wavelet Flow Matching for Ultrafast Multi-Modal MRI Synthesis
by: Tur, Yalcin, et al.
Published: (2026)
by: Tur, Yalcin, et al.
Published: (2026)
FedMM: Federated Multi-Modal Learning with Modality Heterogeneity in Computational Pathology
by: Peng, Yuanzhe, et al.
Published: (2024)
by: Peng, Yuanzhe, et al.
Published: (2024)
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
by: Ma, Xinyu, et al.
Published: (2024)
by: Ma, Xinyu, et al.
Published: (2024)
M3D-Net: Multi-Modal 3D Facial Feature Reconstruction Network for Deepfake Detection
by: Wu, Haotian, et al.
Published: (2026)
by: Wu, Haotian, et al.
Published: (2026)
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
by: Zhou, Yang, et al.
Published: (2025)
by: Zhou, Yang, et al.
Published: (2025)
IDSelect: A RL-Based Cost-Aware Selection Agent for Video-based Multi-Modal Person Recognition
by: Ji, Yuyang, et al.
Published: (2026)
by: Ji, Yuyang, et al.
Published: (2026)
MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics
by: Cai, Cong, et al.
Published: (2024)
by: Cai, Cong, et al.
Published: (2024)
RMMSS: Towards Advanced Robust Multi-Modal Semantic Segmentation with Hybrid Prototype Distillation and Feature Selection
by: Tan, Jiaqi, et al.
Published: (2025)
by: Tan, Jiaqi, et al.
Published: (2025)
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
by: Yang, Jian, et al.
Published: (2024)
by: Yang, Jian, et al.
Published: (2024)
Multi-party Collaborative Attention Control for Image Customization
by: Yang, Han, et al.
Published: (2025)
by: Yang, Han, et al.
Published: (2025)
UAVScenes: A Multi-Modal Dataset for UAVs
by: Wang, Sijie, et al.
Published: (2025)
by: Wang, Sijie, et al.
Published: (2025)
MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities
by: Lincetto, Federico, et al.
Published: (2025)
by: Lincetto, Federico, et al.
Published: (2025)
Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
by: Zhu, Xue-Feng, et al.
Published: (2025)
by: Zhu, Xue-Feng, et al.
Published: (2025)
OnlineSI: Taming Large Language Model for Online 3D Understanding and Grounding
by: Liu, Zixian, et al.
Published: (2026)
by: Liu, Zixian, et al.
Published: (2026)
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
by: Qiu, Haonan, et al.
Published: (2024)
by: Qiu, Haonan, et al.
Published: (2024)
Linking Modality Isolation in Heterogeneous Collaborative Perception
by: Liu, Changxing, et al.
Published: (2026)
by: Liu, Changxing, et al.
Published: (2026)
Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques
by: Liu, Weide, et al.
Published: (2025)
by: Liu, Weide, et al.
Published: (2025)
Similar Items
-
Unified Multi-Modal Image Synthesis for Missing Modality Imputation
by: Zhang, Yue, et al.
Published: (2023) -
Collaborative Multi-Modal Coding for High-Quality 3D Generation
by: Cao, Ziang, et al.
Published: (2025) -
Towards Open Domain Text-Driven Synthesis of Multi-Person Motions
by: Shan, Mengyi, et al.
Published: (2024) -
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
by: Liu, Meilin, et al.
Published: (2026) -
Triple-Phase Sequential Fusion Network for Hepatobiliary Phase Liver MRI Synthesis
by: Wang, Qiuli, et al.
Published: (2026)