Saved in:
| Main Authors: | Xu, Tingqiao, Zeng, Ziru, Chen, Jiayu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.15317 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
by: Li, Xiaotong, et al.
Published: (2024)
by: Li, Xiaotong, et al.
Published: (2024)
VERITAS: A Unified Approach to Reliability Evaluation
by: Ramamurthy, Rajkumar, et al.
Published: (2024)
by: Ramamurthy, Rajkumar, et al.
Published: (2024)
Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification
by: Chen, Shijing, et al.
Published: (2025)
by: Chen, Shijing, et al.
Published: (2025)
Advancing Multimodal Data Fusion in Pain Recognition: A Strategy Leveraging Statistical Correlation and Human-Centered Perspectives
by: Gu, Xingrui, et al.
Published: (2024)
by: Gu, Xingrui, et al.
Published: (2024)
Enhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware Minimization
by: Xu, Le, et al.
Published: (2025)
by: Xu, Le, et al.
Published: (2025)
Leveraging Mixture of Experts for Improved Speech Deepfake Detection
by: Negroni, Viola, et al.
Published: (2024)
by: Negroni, Viola, et al.
Published: (2024)
ColonScopeX: Leveraging Explainable Expert Systems with Multimodal Data for Improved Early Diagnosis of Colorectal Cancer
by: Sikora, Natalia, et al.
Published: (2025)
by: Sikora, Natalia, et al.
Published: (2025)
Sparsely Multimodal Data Fusion
by: Bjorgaard, Josiah
Published: (2024)
by: Bjorgaard, Josiah
Published: (2024)
Clustering by Attention: Leveraging Prior Fitted Transformers for Data Partitioning
by: Shokry, Ahmed, et al.
Published: (2025)
by: Shokry, Ahmed, et al.
Published: (2025)
WGRAMMAR: Leverage Prior Knowledge to Accelerate Structured Decoding
by: Wang, Ran, et al.
Published: (2025)
by: Wang, Ran, et al.
Published: (2025)
Gradient Inversion Transcript: Leveraging Robust Generative Priors to Reconstruct Training Data from Gradient Leakage
by: Chen, Xinping, et al.
Published: (2025)
by: Chen, Xinping, et al.
Published: (2025)
VERITAS: Verifying the Performance of AI-native Transceiver Actions in Base-Stations
by: Soltani, Nasim, et al.
Published: (2025)
by: Soltani, Nasim, et al.
Published: (2025)
Quid est VERITAS? A Modular Framework for Archival Document Analysis
by: Bassanini, Leonardo, et al.
Published: (2026)
by: Bassanini, Leonardo, et al.
Published: (2026)
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)
by: Wilcoxson, Max, et al.
Published: (2024)
LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks
by: Liu, Hui, et al.
Published: (2025)
by: Liu, Hui, et al.
Published: (2025)
DixitWorld: Evaluating Multimodal Abductive Reasoning in Vision-Language Models with Multi-Agent Dixit Gameplay
by: Mo, Yunxiang, et al.
Published: (2025)
by: Mo, Yunxiang, et al.
Published: (2025)
Leveraging Vision Capabilities of Multimodal LLMs for Automated Data Extraction from Plots
by: Polak, Maciej P., et al.
Published: (2025)
by: Polak, Maciej P., et al.
Published: (2025)
A Unified Framework for Emotion Recognition and Sentiment Analysis via Expert-Guided Multimodal Fusion with Large Language Models
by: Qiao, Jiaqi, et al.
Published: (2026)
by: Qiao, Jiaqi, et al.
Published: (2026)
ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors
by: Xu, Zifan, et al.
Published: (2026)
by: Xu, Zifan, et al.
Published: (2026)
MoPE: Mixture of Prompt Experts for Parameter-Efficient and Scalable Multimodal Fusion
by: Jiang, Ruixiang, et al.
Published: (2024)
by: Jiang, Ruixiang, et al.
Published: (2024)
Knowledge-Data Fusion Based Source-Free Semi-Supervised Domain Adaptation for Seizure Subtype Classification
by: Peng, Ruimin, et al.
Published: (2024)
by: Peng, Ruimin, et al.
Published: (2024)
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
by: Zhang, Qingyang, et al.
Published: (2024)
by: Zhang, Qingyang, et al.
Published: (2024)
HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data
by: Hemker, Konstantin, et al.
Published: (2023)
by: Hemker, Konstantin, et al.
Published: (2023)
OrdMoE: Preference Alignment via Hierarchical Expert Group Ranking in Multimodal Mixture-of-Experts LLMs
by: Gao, Yuting, et al.
Published: (2025)
by: Gao, Yuting, et al.
Published: (2025)
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
by: Peng, Bo, et al.
Published: (2024)
by: Peng, Bo, et al.
Published: (2024)
Leveraging Multimodal Data and Side Users for Diffusion Cross-Domain Recommendation
by: Zhang, Fan, et al.
Published: (2025)
by: Zhang, Fan, et al.
Published: (2025)
Redefining Data Pairing for Motion Retargeting Leveraging a Human Body Prior
by: Figuera, Xiyana, et al.
Published: (2024)
by: Figuera, Xiyana, et al.
Published: (2024)
LUMIR: an LLM-Driven Unified Agent Framework for Multi-task Infrared Spectroscopy Reasoning
by: Xie, Zujie, et al.
Published: (2025)
by: Xie, Zujie, et al.
Published: (2025)
Interpretable Alzheimer's Diagnosis via Multimodal Fusion of Regional Brain Experts
by: Zhuang, Farica, et al.
Published: (2025)
by: Zhuang, Farica, et al.
Published: (2025)
Orchestrating Heterogeneous Experts: A Scalable MoE Framework with Anisotropy-Preserving Fusion
by: Liu, Ye, et al.
Published: (2025)
by: Liu, Ye, et al.
Published: (2025)
Mixpert: Mitigating Multimodal Learning Conflicts with Efficient Mixture-of-Vision-Experts
by: He, Xin, et al.
Published: (2025)
by: He, Xin, et al.
Published: (2025)
Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance
by: Yang, Fengze, et al.
Published: (2025)
by: Yang, Fengze, et al.
Published: (2025)
MELON: Multimodal Mixture-of-Experts with Spectral-Temporal Fusion for Long-Term Mobility Estimation in Critical Care
by: Zhang, Jiaqing, et al.
Published: (2025)
by: Zhang, Jiaqing, et al.
Published: (2025)
FusionCast: Enhancing Precipitation Nowcasting with Asymmetric Cross-Modal Fusion and Future Radar Priors
by: Wang, Henan, et al.
Published: (2026)
by: Wang, Henan, et al.
Published: (2026)
Leveraging Information Consistency in Frequency and Spatial Domain for Adversarial Attacks
by: Jin, Zhibo, et al.
Published: (2024)
by: Jin, Zhibo, et al.
Published: (2024)
SignMouth: Leveraging Mouthing Cues for Sign Language Translation by Multimodal Contrastive Fusion
by: Wu, Wenfang, et al.
Published: (2025)
by: Wu, Wenfang, et al.
Published: (2025)
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
by: Ma, Xiaoyu, et al.
Published: (2025)
by: Ma, Xiaoyu, et al.
Published: (2025)
MMSearch-Plus: Benchmarking Provenance-Aware Search for Multimodal Browsing Agents
by: Tao, Xijia, et al.
Published: (2025)
by: Tao, Xijia, et al.
Published: (2025)
AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixture of Expert
by: Gao, Yuting, et al.
Published: (2025)
by: Gao, Yuting, et al.
Published: (2025)
Latent Space Data Fusion Outperforms Early Fusion in Multimodal Mental Health Digital Phenotyping Data
by: Barkat, Youcef, et al.
Published: (2025)
by: Barkat, Youcef, et al.
Published: (2025)
Similar Items
-
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
by: Li, Xiaotong, et al.
Published: (2024) -
VERITAS: A Unified Approach to Reliability Evaluation
by: Ramamurthy, Rajkumar, et al.
Published: (2024) -
Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification
by: Chen, Shijing, et al.
Published: (2025) -
Advancing Multimodal Data Fusion in Pain Recognition: A Strategy Leveraging Statistical Correlation and Human-Centered Perspectives
by: Gu, Xingrui, et al.
Published: (2024) -
Enhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware Minimization
by: Xu, Le, et al.
Published: (2025)