Saved in:
| Main Authors: | Ouyang, Xueqiang, Wei, Jia, Huo, Wenjie, Wang, Xiaocong, Li, Rui, Zhou, Jianlong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.04353 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MolFM-Lite: Multi-Modal Molecular Property Prediction with Conformer Ensemble Attention and Cross-Modal Fusion
by: Shah, Syed Omer, et al.
Published: (2026)
by: Shah, Syed Omer, et al.
Published: (2026)
DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency
by: Yao, Wenfang, et al.
Published: (2024)
by: Yao, Wenfang, et al.
Published: (2024)
DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion
by: Yin, Zixuan, et al.
Published: (2024)
by: Yin, Zixuan, et al.
Published: (2024)
Multi-Modal Sensor Fusion using Hybrid Attention for Autonomous Driving
by: Mayank, Mayank, et al.
Published: (2026)
by: Mayank, Mayank, et al.
Published: (2026)
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
by: Cho, Minkyoung, et al.
Published: (2024)
by: Cho, Minkyoung, et al.
Published: (2024)
Predictive Dynamic Fusion
by: Cao, Bing, et al.
Published: (2024)
by: Cao, Bing, et al.
Published: (2024)
Tactile Modality Fusion for Vision-Language-Action Models
by: Morissette, Charlotte, et al.
Published: (2026)
by: Morissette, Charlotte, et al.
Published: (2026)
CXR-TFT: Multi-Modal Temporal Fusion Transformer for Predicting Chest X-ray Trajectories
by: Arora, Mehak, et al.
Published: (2025)
by: Arora, Mehak, et al.
Published: (2025)
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
by: Ding, Dexuan, et al.
Published: (2024)
by: Ding, Dexuan, et al.
Published: (2024)
Deep Learning-Based Multi-Modal Fusion for Robust Robot Perception and Navigation
by: Lai, Delun, et al.
Published: (2025)
by: Lai, Delun, et al.
Published: (2025)
SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction
by: Chen, Kai, et al.
Published: (2025)
by: Chen, Kai, et al.
Published: (2025)
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
by: Srinivas, Sakhinana Sagar, et al.
Published: (2024)
by: Srinivas, Sakhinana Sagar, et al.
Published: (2024)
Multi-Modal Fusion of In-Situ Video Data and Process Parameters for Online Forecasting of Cookie Drying Readiness
by: Li, Shichen, et al.
Published: (2025)
by: Li, Shichen, et al.
Published: (2025)
Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision
by: Sun, Tianyao, et al.
Published: (2025)
by: Sun, Tianyao, et al.
Published: (2025)
Rethinking Few-Shot Image Fusion: Granular Ball Priors Enable General-Purpose Deep Fusion
by: Deng, Minjie, et al.
Published: (2025)
by: Deng, Minjie, et al.
Published: (2025)
BOFA: Bridge-Layer Orthogonal Low-Rank Fusion for CLIP-Based Class-Incremental Learning
by: Li, Lan, et al.
Published: (2025)
by: Li, Lan, et al.
Published: (2025)
TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations
by: Landry, François G., et al.
Published: (2025)
by: Landry, François G., et al.
Published: (2025)
RAFNet: Region-Aware Fusion Network for Pansharpening
by: Zhang, Jianing, et al.
Published: (2026)
by: Zhang, Jianing, et al.
Published: (2026)
DashFusion: Dual-stream Alignment with Hierarchical Bottleneck Fusion for Multimodal Sentiment Analysis
by: Wen, Yuhua, et al.
Published: (2025)
by: Wen, Yuhua, et al.
Published: (2025)
Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
Compact Twice Fusion Network for Edge Detection
by: Li, Yachuan, et al.
Published: (2023)
by: Li, Yachuan, et al.
Published: (2023)
MCFCN: Multi-View Clustering via a Fusion-Consensus Graph Convolutional Network
by: Pei, Chenping, et al.
Published: (2025)
by: Pei, Chenping, et al.
Published: (2025)
WTTFNet: A Weather-Time-Trajectory Fusion Network for Pedestrian Trajectory Prediction in Urban Complex
by: Wu, Ho Chun, et al.
Published: (2024)
by: Wu, Ho Chun, et al.
Published: (2024)
ACD-CLIP: Decoupling Representation and Dynamic Fusion for Zero-Shot Anomaly Detection
by: Ma, Ke, et al.
Published: (2025)
by: Ma, Ke, et al.
Published: (2025)
Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective
by: Huang, Yihao, et al.
Published: (2024)
by: Huang, Yihao, et al.
Published: (2024)
TripleSumm: Adaptive Triple-Modality Fusion for Video Summarization
by: Kim, Sumin, et al.
Published: (2026)
by: Kim, Sumin, et al.
Published: (2026)
DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion
by: Xu, Jian, et al.
Published: (2024)
by: Xu, Jian, et al.
Published: (2024)
DPL: Decoupled Prototype Learning for Enhancing Robustness of Vision-Language Transformers to Missing Modalities
by: Lu, Jueqing, et al.
Published: (2025)
by: Lu, Jueqing, et al.
Published: (2025)
Wolf2Pack: The AutoFusion Framework for Dynamic Parameter Fusion
by: Tian, Bowen, et al.
Published: (2024)
by: Tian, Bowen, et al.
Published: (2024)
DualSwinFusionSeg: Multimodal Martian Landslide Segmentation via Dual Swin Transformer with Multi-Scale Fusion and UNet++
by: Kabir, Shahriar, et al.
Published: (2026)
by: Kabir, Shahriar, et al.
Published: (2026)
MHSA: A Multi-scale Hypergraph Network for Mild Cognitive Impairment Detection via Synchronous and Attentive Fusion
by: Yuan, Manman, et al.
Published: (2024)
by: Yuan, Manman, et al.
Published: (2024)
Applying Graph Explanation to Operator Fusion
by: Mills, Keith G., et al.
Published: (2024)
by: Mills, Keith G., et al.
Published: (2024)
Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection
by: Fan, Yunfeng, et al.
Published: (2023)
by: Fan, Yunfeng, et al.
Published: (2023)
MultiFusionNet: Multilayer Multimodal Fusion of Deep Neural Networks for Chest X-Ray Image Classification
by: Agarwal, Saurabh, et al.
Published: (2024)
by: Agarwal, Saurabh, et al.
Published: (2024)
Multimodal Fusion of Glucose Monitoring and Food Imagery for Caloric Content Prediction
by: Kumar, Adarsh
Published: (2025)
by: Kumar, Adarsh
Published: (2025)
DyCAF-Net: Dynamic Class-Aware Fusion Network
by: Jahin, Md Abrar, et al.
Published: (2025)
by: Jahin, Md Abrar, et al.
Published: (2025)
Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision
by: Ryumina, Elena, et al.
Published: (2024)
by: Ryumina, Elena, et al.
Published: (2024)
UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations
by: Mühlematter, Dominik J., et al.
Published: (2025)
by: Mühlematter, Dominik J., et al.
Published: (2025)
Beyond Simple Fusion: Adaptive Gated Fusion for Robust Multimodal Sentiment Analysis
by: Wu, Han, et al.
Published: (2025)
by: Wu, Han, et al.
Published: (2025)
Spiking Neural Network Feature Discrimination Boosts Modality Fusion
by: Oikonomou, Katerina Maria, et al.
Published: (2025)
by: Oikonomou, Katerina Maria, et al.
Published: (2025)
Similar Items
-
MolFM-Lite: Multi-Modal Molecular Property Prediction with Conformer Ensemble Attention and Cross-Modal Fusion
by: Shah, Syed Omer, et al.
Published: (2026) -
DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency
by: Yao, Wenfang, et al.
Published: (2024) -
DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion
by: Yin, Zixuan, et al.
Published: (2024) -
Multi-Modal Sensor Fusion using Hybrid Attention for Autonomous Driving
by: Mayank, Mayank, et al.
Published: (2026) -
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
by: Cho, Minkyoung, et al.
Published: (2024)