Saved in:
| Main Authors: | Gu, Xingrui, Wang, Zhixuan, Jin, Irisa, Wu, Zekun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00320 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VERITAS: Leveraging Vision Priors and Expert Fusion to Improve Multimodal Data
by: Xu, Tingqiao, et al.
Published: (2025)
by: Xu, Tingqiao, et al.
Published: (2025)
Feature Fusion Based on Mutual-Cross-Attention Mechanism for EEG Emotion Recognition
by: Zhao, Yimin, et al.
Published: (2024)
by: Zhao, Yimin, et al.
Published: (2024)
Conceptual Belief-Informed Reinforcement Learning
by: Gu, Xingrui, et al.
Published: (2024)
by: Gu, Xingrui, et al.
Published: (2024)
MPF: Aligning and Debiasing Language Models post Deployment via Multi Perspective Fusion
by: Guan, Xin, et al.
Published: (2025)
by: Guan, Xin, et al.
Published: (2025)
Centering Emotion Hotspots: Multimodal Local-Global Fusion and Cross-Modal Alignment for Emotion Recognition in Conversations
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
Leveraging Label Potential for Enhanced Multimodal Emotion Recognition
by: Shao, Xuechun, et al.
Published: (2025)
by: Shao, Xuechun, et al.
Published: (2025)
SignMouth: Leveraging Mouthing Cues for Sign Language Translation by Multimodal Contrastive Fusion
by: Wu, Wenfang, et al.
Published: (2025)
by: Wu, Wenfang, et al.
Published: (2025)
Effective Instruction Parsing Plugin for Complex Logical Query Answering on Knowledge Graphs
by: Zhuo, Xingrui, et al.
Published: (2024)
by: Zhuo, Xingrui, et al.
Published: (2024)
Human-Centered Human-AI Interaction (HC-HAII): A Human-Centered AI Perspective
by: Xu, Wei
Published: (2025)
by: Xu, Wei
Published: (2025)
Sparsely Multimodal Data Fusion
by: Bjorgaard, Josiah
Published: (2024)
by: Bjorgaard, Josiah
Published: (2024)
V2P: Visual Attention Calibration for GUI Grounding via Background Suppression and Center Peaking
by: Chen, Jikai, et al.
Published: (2026)
by: Chen, Jikai, et al.
Published: (2026)
V2P: Visual Attention Calibration for GUI Grounding via Background Suppression and Center Peaking
by: Chen, Jikai, et al.
Published: (2025)
by: Chen, Jikai, et al.
Published: (2025)
Multimodal Functional Maximum Correlation for Emotion Recognition
by: Zheng, Deyang, et al.
Published: (2025)
by: Zheng, Deyang, et al.
Published: (2025)
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion
by: Li, Jiagen, et al.
Published: (2025)
by: Li, Jiagen, et al.
Published: (2025)
Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension
by: Wang, Ning, et al.
Published: (2024)
by: Wang, Ning, et al.
Published: (2024)
Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
by: Liu, Yexiang, et al.
Published: (2025)
by: Liu, Yexiang, et al.
Published: (2025)
Stabilizing Multimodal Autoencoders: A Theoretical and Empirical Analysis of Fusion Strategies
by: Altinses, Diyar, et al.
Published: (2025)
by: Altinses, Diyar, et al.
Published: (2025)
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
by: Liu, Xiao, et al.
Published: (2024)
by: Liu, Xiao, et al.
Published: (2024)
Dynamic Fusion-Aware Graph Convolutional Neural Network for Multimodal Emotion Recognition in Conversations
by: Meng, Tao, et al.
Published: (2026)
by: Meng, Tao, et al.
Published: (2026)
Rethinking Normalization Strategies and Convolutional Kernels for Multimodal Image Fusion
by: He, Dan, et al.
Published: (2024)
by: He, Dan, et al.
Published: (2024)
Knowledge Restoration-driven Prompt Optimization: Unlocking LLM Potential for Open-Domain Relational Triplet Extraction
by: Jing, Xiaonan, et al.
Published: (2026)
by: Jing, Xiaonan, et al.
Published: (2026)
Sync-TVA: A Graph-Attention Framework for Multimodal Emotion Recognition with Cross-Modal Fusion
by: Deng, Zeyu, et al.
Published: (2025)
by: Deng, Zeyu, et al.
Published: (2025)
Application of Multimodal Fusion Deep Learning Model in Disease Recognition
by: Liu, Xiaoyi, et al.
Published: (2024)
by: Liu, Xiaoyi, et al.
Published: (2024)
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models
by: Wang, Xingrui, et al.
Published: (2025)
by: Wang, Xingrui, et al.
Published: (2025)
Appformer: A Novel Framework for Mobile App Usage Prediction Leveraging Progressive Multi-Modal Data Fusion and Feature Extraction
by: Sun, Chuike, et al.
Published: (2024)
by: Sun, Chuike, et al.
Published: (2024)
Dual-Loop Control in DCVerse: Advancing Reliable Deployment of AI in Data Centers via Digital Twins
by: Zhang, Qingang, et al.
Published: (2026)
by: Zhang, Qingang, et al.
Published: (2026)
HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data
by: Hemker, Konstantin, et al.
Published: (2023)
by: Hemker, Konstantin, et al.
Published: (2023)
Leveraging Multimodal Data and Side Users for Diffusion Cross-Domain Recommendation
by: Zhang, Fan, et al.
Published: (2025)
by: Zhang, Fan, et al.
Published: (2025)
Pedestrian Attribute Recognition via CLIP based Prompt Vision-Language Fusion
by: Wang, Xiao, et al.
Published: (2023)
by: Wang, Xiao, et al.
Published: (2023)
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
by: Li, Qifei, et al.
Published: (2024)
by: Li, Qifei, et al.
Published: (2024)
Multimodal Fusion on Low-quality Data: A Comprehensive Survey
by: Zhang, Qingyang, et al.
Published: (2024)
by: Zhang, Qingyang, et al.
Published: (2024)
CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features
by: Cho, Seonglae, et al.
Published: (2025)
by: Cho, Seonglae, et al.
Published: (2025)
Fusion Intelligence for Digital Twinning AI Data Centers: A Synergistic GenAI-PhyAI Approach
by: Wang, Ruihang, et al.
Published: (2025)
by: Wang, Ruihang, et al.
Published: (2025)
TACFN: Transformer-based Adaptive Cross-modal Fusion Network for Multimodal Emotion Recognition
by: Liu, Feng, et al.
Published: (2025)
by: Liu, Feng, et al.
Published: (2025)
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
by: Zhao, Heyang, et al.
Published: (2025)
by: Zhao, Heyang, et al.
Published: (2025)
Quantum Data Center: Perspectives
by: Liu, Junyu, et al.
Published: (2023)
by: Liu, Junyu, et al.
Published: (2023)
Towards Effective Fusion and Forecasting of Multimodal Spatio-temporal Data for Smart Mobility
by: Wang, Chenxing
Published: (2024)
by: Wang, Chenxing
Published: (2024)
Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach
by: Alshehri, Abeer, et al.
Published: (2024)
by: Alshehri, Abeer, et al.
Published: (2024)
Understanding LLM Behaviors via Compression: Data Generation, Knowledge Acquisition and Scaling Laws
by: Pan, Zhixuan, et al.
Published: (2025)
by: Pan, Zhixuan, et al.
Published: (2025)
Efficient Pain Recognition via Respiration Signals: A Single Cross-Attention Transformer Multi-Window Fusion Pipeline
by: Gkikas, Stefanos, et al.
Published: (2025)
by: Gkikas, Stefanos, et al.
Published: (2025)
Similar Items
-
VERITAS: Leveraging Vision Priors and Expert Fusion to Improve Multimodal Data
by: Xu, Tingqiao, et al.
Published: (2025) -
Feature Fusion Based on Mutual-Cross-Attention Mechanism for EEG Emotion Recognition
by: Zhao, Yimin, et al.
Published: (2024) -
Conceptual Belief-Informed Reinforcement Learning
by: Gu, Xingrui, et al.
Published: (2024) -
MPF: Aligning and Debiasing Language Models post Deployment via Multi Perspective Fusion
by: Guan, Xin, et al.
Published: (2025) -
Centering Emotion Hotspots: Multimodal Local-Global Fusion and Cross-Modal Alignment for Emotion Recognition in Conversations
by: Liu, Yu, et al.
Published: (2025)