Saved in:
| Main Authors: | Zhang, Jiaqi, Liu, Zhuodong, Yu, Kejian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.02441 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Residual Cross-Modal Fusion Networks for Audio-Visual Navigation
by: Wang, Yi, et al.
Published: (2026)
by: Wang, Yi, et al.
Published: (2026)
Crop Pest Classification Using Deep Learning Techniques: A Review
by: Ejaz, Muhammad Hassam, et al.
Published: (2025)
by: Ejaz, Muhammad Hassam, et al.
Published: (2025)
DCVD: Dual-Channel Cross-Modal Fusion for Joint Vulnerability Detection and Localization
by: Tang, Wenxin, et al.
Published: (2026)
by: Tang, Wenxin, et al.
Published: (2026)
Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion
by: Zhang, Han, et al.
Published: (2026)
by: Zhang, Han, et al.
Published: (2026)
Multi-Scale Adaptive Neighborhood Awareness Transformer For Graph Fraud Detection
by: Lv, Jiaqi, et al.
Published: (2026)
by: Lv, Jiaqi, et al.
Published: (2026)
A Depression Detection Method Based on Multi-Modal Feature Fusion Using Cross-Attention
by: Li, Shengjie, et al.
Published: (2024)
by: Li, Shengjie, et al.
Published: (2024)
A Generalized Multi-Modal Fusion Detection Framework
by: Cui, Leichao, et al.
Published: (2023)
by: Cui, Leichao, et al.
Published: (2023)
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
by: Li, Shuyu, et al.
Published: (2025)
by: Li, Shuyu, et al.
Published: (2025)
Memory-Augmented Knowledge Fusion with Safety-Aware Decoding for Domain-Adaptive Question Answering
by: Fu, Lei, et al.
Published: (2025)
by: Fu, Lei, et al.
Published: (2025)
FusionCast: Enhancing Precipitation Nowcasting with Asymmetric Cross-Modal Fusion and Future Radar Priors
by: Wang, Henan, et al.
Published: (2026)
by: Wang, Henan, et al.
Published: (2026)
Representation Learning with Mutual Influence of Modalities for Node Classification in Multi-Modal Heterogeneous Networks
by: Li, Jiafan, et al.
Published: (2025)
by: Li, Jiafan, et al.
Published: (2025)
PosterGen: Aesthetic-Aware Multi-Modal Paper-to-Poster Generation via Multi-Agent LLMs
by: Zhang, Zhilin, et al.
Published: (2025)
by: Zhang, Zhilin, et al.
Published: (2025)
MAGNet: A Multi-Scale Attention-Guided Graph Fusion Network for DRC Violation Detection
by: Lu, Weihan, et al.
Published: (2025)
by: Lu, Weihan, et al.
Published: (2025)
CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection
by: Kang, Jincheng, et al.
Published: (2025)
by: Kang, Jincheng, et al.
Published: (2025)
Fusion-Mamba for Cross-modality Object Detection
by: Dong, Wenhao, et al.
Published: (2024)
by: Dong, Wenhao, et al.
Published: (2024)
MGHFT: Multi-Granularity Hierarchical Fusion Transformer for Cross-Modal Sticker Emotion Recognition
by: Chen, Jian, et al.
Published: (2025)
by: Chen, Jian, et al.
Published: (2025)
Centering Emotion Hotspots: Multimodal Local-Global Fusion and Cross-Modal Alignment for Emotion Recognition in Conversations
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations
by: Kim, Jeonghyeon, et al.
Published: (2025)
by: Kim, Jeonghyeon, et al.
Published: (2025)
Exposing Cross-Modal Consistency for Fake News Detection in Short-Form Videos
by: Tian, Chong, et al.
Published: (2026)
by: Tian, Chong, et al.
Published: (2026)
Cross-Modal Purification and Fusion for Small-Object RGB-D Transmission-Line Defect Detection
by: Cui, Jiaming, et al.
Published: (2026)
by: Cui, Jiaming, et al.
Published: (2026)
PestMA: LLM-based Multi-Agent System for Informed Pest Management
by: Shi, Hongrui, et al.
Published: (2025)
by: Shi, Hongrui, et al.
Published: (2025)
Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
by: Lou, Yang, et al.
Published: (2023)
by: Lou, Yang, et al.
Published: (2023)
FusionFM: All-in-One Multi-Modal Image Fusion with Flow Matching
by: Zhu, Huayi, et al.
Published: (2025)
by: Zhu, Huayi, et al.
Published: (2025)
FLoRA: Fusion-Latent for Optical Reconstruction and Flood Area Segmentation via Cross-Modal Multi-Task Distillation Network
by: Talreja, Jagrati, et al.
Published: (2026)
by: Talreja, Jagrati, et al.
Published: (2026)
MMSR: Symbolic Regression is a Multi-Modal Information Fusion Task
by: Li, Yanjie, et al.
Published: (2024)
by: Li, Yanjie, et al.
Published: (2024)
Multi-Modal Sentiment Analysis with Dynamic Attention Fusion
by: Abdulhalim, Sadia, et al.
Published: (2025)
by: Abdulhalim, Sadia, et al.
Published: (2025)
Multi-modal Data Fusion and Deep Ensemble Learning for Accurate Crop Yield Prediction
by: Yewle, Akshay Dagadu, et al.
Published: (2025)
by: Yewle, Akshay Dagadu, et al.
Published: (2025)
DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling
by: Zhang, Zhihong, et al.
Published: (2026)
by: Zhang, Zhihong, et al.
Published: (2026)
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
by: Guo, Ziyu, et al.
Published: (2025)
by: Guo, Ziyu, et al.
Published: (2025)
VIFO: Visual Feature Empowered Multivariate Time Series Forecasting with Cross-Modal Fusion
by: Wang, Yanlong, et al.
Published: (2025)
by: Wang, Yanlong, et al.
Published: (2025)
Automated Plant Disease and Pest Detection System Using Hybrid Lightweight CNN-MobileViT Models for Diagnosis of Indigenous Crops
by: Gebremedhin, Tekleab G., et al.
Published: (2025)
by: Gebremedhin, Tekleab G., et al.
Published: (2025)
Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection
by: Yu, Cai, et al.
Published: (2024)
by: Yu, Cai, et al.
Published: (2024)
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
by: Fang, Xiang, et al.
Published: (2022)
by: Fang, Xiang, et al.
Published: (2022)
Late Fusion and Multi-Level Fission Amplify Cross-Modal Transfer in Text-Speech LMs
by: Cuervo, Santiago, et al.
Published: (2025)
by: Cuervo, Santiago, et al.
Published: (2025)
Hypergraph and Latent ODE Learning for Multimodal Root Cause Localization in Microservices
by: Liu, Xin, et al.
Published: (2026)
by: Liu, Xin, et al.
Published: (2026)
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection
by: Beemelmanns, Till, et al.
Published: (2024)
by: Beemelmanns, Till, et al.
Published: (2024)
A Knowledge-Guided Cross-Modal Feature Fusion Model for Local Traffic Demand Prediction
by: Zhang, Lingyu, et al.
Published: (2025)
by: Zhang, Lingyu, et al.
Published: (2025)
Data-driven Modality Fusion: An AI-enabled Framework for Large-Scale Sensor Network Management
by: Dutta, Hrishikesh, et al.
Published: (2025)
by: Dutta, Hrishikesh, et al.
Published: (2025)
A Privacy-Preserving Framework with Multi-Modal Data for Cross-Domain Recommendation
by: Wang, Li, et al.
Published: (2024)
by: Wang, Li, et al.
Published: (2024)
CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
by: Zhang, Yongheng, et al.
Published: (2025)
by: Zhang, Yongheng, et al.
Published: (2025)
Similar Items
-
Residual Cross-Modal Fusion Networks for Audio-Visual Navigation
by: Wang, Yi, et al.
Published: (2026) -
Crop Pest Classification Using Deep Learning Techniques: A Review
by: Ejaz, Muhammad Hassam, et al.
Published: (2025) -
DCVD: Dual-Channel Cross-Modal Fusion for Joint Vulnerability Detection and Localization
by: Tang, Wenxin, et al.
Published: (2026) -
Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion
by: Zhang, Han, et al.
Published: (2026) -
Multi-Scale Adaptive Neighborhood Awareness Transformer For Graph Fraud Detection
by: Lv, Jiaqi, et al.
Published: (2026)