Saved in:
Bibliographic Details
Main Authors: Li, Yangle, Luo, Danli, Hu, Haifeng
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.07430
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912753965334528
author Li, Yangle
Luo, Danli
Hu, Haifeng
author_facet Li, Yangle
Luo, Danli
Hu, Haifeng
contents Existing methods in domain generalization for Multimodal Sentiment Analysis (MSA) often overlook inter-modal synergies during invariant features extraction, which prevents the accurate capture of the rich semantic information within multimodal data. Additionally, while knowledge injection techniques have been explored in MSA, they often suffer from fragmented cross-modal knowledge, overlooking specific representations that exist beyond the confines of unimodal. To address these limitations, we propose a novel MSA framework designed for domain generalization. Firstly, the framework incorporates a Mixture of Invariant Experts model to extract domain-invariant features, thereby enhancing the model's capacity to learn synergistic relationships between modalities. Secondly, we design a Cross-Modal Adapter to augment the semantic richness of multimodal representations through cross-modal knowledge injection. Extensive domain experiments conducted on three datasets demonstrate that the proposed MIDG achieves superior performance.
format Preprint
id arxiv_https___arxiv_org_abs_2512_07430
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle MIDG: Mixture of Invariant Experts with knowledge injection for Domain Generalization in Multimodal Sentiment Analysis
Li, Yangle
Luo, Danli
Hu, Haifeng
Machine Learning
Artificial Intelligence
Existing methods in domain generalization for Multimodal Sentiment Analysis (MSA) often overlook inter-modal synergies during invariant features extraction, which prevents the accurate capture of the rich semantic information within multimodal data. Additionally, while knowledge injection techniques have been explored in MSA, they often suffer from fragmented cross-modal knowledge, overlooking specific representations that exist beyond the confines of unimodal. To address these limitations, we propose a novel MSA framework designed for domain generalization. Firstly, the framework incorporates a Mixture of Invariant Experts model to extract domain-invariant features, thereby enhancing the model's capacity to learn synergistic relationships between modalities. Secondly, we design a Cross-Modal Adapter to augment the semantic richness of multimodal representations through cross-modal knowledge injection. Extensive domain experiments conducted on three datasets demonstrate that the proposed MIDG achieves superior performance.
title MIDG: Mixture of Invariant Experts with knowledge injection for Domain Generalization in Multimodal Sentiment Analysis
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2512.07430