Saved in:
| Main Authors: | Xia, Zixuan, Wang, Hao, Weng, Pengcheng, Qian, Yanyu, Xu, Yangxin, Dan, William, Wang, Fei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21670 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
COMPASS: Complete Multimodal Fusion via Proxy Tokens and Shared Spaces for Ubiquitous Sensing
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Purify-then-Align: Towards Robust Human Sensing under Modality Missing with Knowledge Distillation from Noisy Multimodal Teacher
by: Weng, Pengcheng, et al.
Published: (2026)
by: Weng, Pengcheng, et al.
Published: (2026)
What's on Your Plate? Inferring Chinese Cuisine Intake from Wearable IMUs
by: Yin, Jiaxi, et al.
Published: (2025)
by: Yin, Jiaxi, et al.
Published: (2025)
Automatic Recognition of Abdominal Organs in Ultrasound Images based on Deep Neural Networks and K-Nearest-Neighbor Classification
by: Li, Keyu, et al.
Published: (2021)
by: Li, Keyu, et al.
Published: (2021)
Multimodal Structure Learning: Disentangling Shared and Specific Topology via Cross-Modal Graphical Lasso
by: Wang, Fei, et al.
Published: (2026)
by: Wang, Fei, et al.
Published: (2026)
Interpretable Alzheimer's Diagnosis via Multimodal Fusion of Regional Brain Experts
by: Zhuang, Farica, et al.
Published: (2025)
by: Zhuang, Farica, et al.
Published: (2025)
Textualize Visual Prompt for Image Editing via Diffusion Bridge
by: Xu, Pengcheng, et al.
Published: (2025)
by: Xu, Pengcheng, et al.
Published: (2025)
DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion
by: Yin, Zixuan, et al.
Published: (2024)
by: Yin, Zixuan, et al.
Published: (2024)
Semi-Supervised Variational Adversarial Active Learning via Learning to Rank and Agreement-Based Pseudo Labeling
by: Lyu, Zongyao, et al.
Published: (2024)
by: Lyu, Zongyao, et al.
Published: (2024)
Adaptive Confidence Regularization for Multimodal Failure Detection
by: Liu, Moru, et al.
Published: (2026)
by: Liu, Moru, et al.
Published: (2026)
FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation
by: Tan, Min, et al.
Published: (2026)
by: Tan, Min, et al.
Published: (2026)
Thermodynamically Optimal Regularization under Information-Geometric Constraints
by: Caraffa, Laurent
Published: (2026)
by: Caraffa, Laurent
Published: (2026)
Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models
by: Wang, Enguang, et al.
Published: (2026)
by: Wang, Enguang, et al.
Published: (2026)
MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion
by: Gu, Jihao, et al.
Published: (2025)
by: Gu, Jihao, et al.
Published: (2025)
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
by: Chen, Defang, et al.
Published: (2025)
by: Chen, Defang, et al.
Published: (2025)
Invariant Representation Guided Multimodal Sentiment Decoding with Sequential Variation Regularization
by: Xu, Guoyang, et al.
Published: (2024)
by: Xu, Guoyang, et al.
Published: (2024)
A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models
by: Fang, Tianmeng, et al.
Published: (2026)
by: Fang, Tianmeng, et al.
Published: (2026)
Semantic Residual for Multimodal Unified Discrete Representation
by: Huang, Hai, et al.
Published: (2024)
by: Huang, Hai, et al.
Published: (2024)
FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens
by: Schlarmann, Christian, et al.
Published: (2025)
by: Schlarmann, Christian, et al.
Published: (2025)
Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation
by: Shen, Zhengwen, et al.
Published: (2025)
by: Shen, Zhengwen, et al.
Published: (2025)
DiRe: Diversity-promoting Regularization for Dataset Condensation
by: Mohanty, Saumyaranjan, et al.
Published: (2025)
by: Mohanty, Saumyaranjan, et al.
Published: (2025)
Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation
by: Yao, Wenfang, et al.
Published: (2024)
by: Yao, Wenfang, et al.
Published: (2024)
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
by: Ye, Qinghao, et al.
Published: (2023)
by: Ye, Qinghao, et al.
Published: (2023)
Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection
by: Shao, Qian, et al.
Published: (2024)
by: Shao, Qian, et al.
Published: (2024)
AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection
by: Chen, Zizhao, et al.
Published: (2024)
by: Chen, Zizhao, et al.
Published: (2024)
SMaRt: Improving GANs with Score Matching Regularity
by: Xia, Mengfei, et al.
Published: (2023)
by: Xia, Mengfei, et al.
Published: (2023)
HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition
by: Dong, Pengcheng, et al.
Published: (2025)
by: Dong, Pengcheng, et al.
Published: (2025)
ChronoSelect: Robust Learning with Noisy Labels via Dynamics Temporal Memory
by: Wang, Jianchao, et al.
Published: (2025)
by: Wang, Jianchao, et al.
Published: (2025)
DualSwinFusionSeg: Multimodal Martian Landslide Segmentation via Dual Swin Transformer with Multi-Scale Fusion and UNet++
by: Kabir, Shahriar, et al.
Published: (2026)
by: Kabir, Shahriar, et al.
Published: (2026)
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
by: Wang, Zehan, et al.
Published: (2024)
by: Wang, Zehan, et al.
Published: (2024)
ProtoAda: Prototype-Guided Adaptive Adapter Expansion and Geometric Consolidation for Multimodal Continual Instruction Tuning
by: Shi, Yu-Cheng, et al.
Published: (2026)
by: Shi, Yu-Cheng, et al.
Published: (2026)
UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations
by: Mühlematter, Dominik J., et al.
Published: (2025)
by: Mühlematter, Dominik J., et al.
Published: (2025)
Beyond Simple Fusion: Adaptive Gated Fusion for Robust Multimodal Sentiment Analysis
by: Wu, Han, et al.
Published: (2025)
by: Wu, Han, et al.
Published: (2025)
Uncertainty-o: One Model-agnostic Framework for Unveiling Uncertainty in Large Multimodal Models
by: Zhang, Ruiyang, et al.
Published: (2025)
by: Zhang, Ruiyang, et al.
Published: (2025)
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving
by: Lin, R. D., et al.
Published: (2025)
by: Lin, R. D., et al.
Published: (2025)
DashFusion: Dual-stream Alignment with Hierarchical Bottleneck Fusion for Multimodal Sentiment Analysis
by: Wen, Yuhua, et al.
Published: (2025)
by: Wen, Yuhua, et al.
Published: (2025)
MER-DG: Modality-Entropy Regularization for Multimodal Domain Generalization
by: Yarici, Yavuz, et al.
Published: (2026)
by: Yarici, Yavuz, et al.
Published: (2026)
4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer's Diagnosis
by: Wei, Yuxiang, et al.
Published: (2025)
by: Wei, Yuxiang, et al.
Published: (2025)
Predictive Dynamic Fusion
by: Cao, Bing, et al.
Published: (2024)
by: Cao, Bing, et al.
Published: (2024)
Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales
by: Qi, Shuren, et al.
Published: (2024)
by: Qi, Shuren, et al.
Published: (2024)
Similar Items
-
COMPASS: Complete Multimodal Fusion via Proxy Tokens and Shared Spaces for Ubiquitous Sensing
by: Wang, Hao, et al.
Published: (2026) -
Purify-then-Align: Towards Robust Human Sensing under Modality Missing with Knowledge Distillation from Noisy Multimodal Teacher
by: Weng, Pengcheng, et al.
Published: (2026) -
What's on Your Plate? Inferring Chinese Cuisine Intake from Wearable IMUs
by: Yin, Jiaxi, et al.
Published: (2025) -
Automatic Recognition of Abdominal Organs in Ultrasound Images based on Deep Neural Networks and K-Nearest-Neighbor Classification
by: Li, Keyu, et al.
Published: (2021) -
Multimodal Structure Learning: Disentangling Shared and Specific Topology via Cross-Modal Graphical Lasso
by: Wang, Fei, et al.
Published: (2026)