Saved in:
Bibliographic Details
Main Authors: Wang, Yufei, Wu, Mengyue
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.03004
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929374785175552
author Wang, Yufei
Wu, Mengyue
author_facet Wang, Yufei
Wu, Mengyue
contents Emotion semantic inconsistency is an ubiquitous challenge in multi-modal sentiment analysis (MSA). MSA involves analyzing sentiment expressed across various modalities like text, audio, and videos. Each modality may convey distinct aspects of sentiment, due to subtle and nuanced expression of human beings, leading to inconsistency, which may hinder the prediction of artificial agents. In this work, we introduce a modality conflicting test set and assess the performance of both traditional multi-modal sentiment analysis models and multi-modal large language models (MLLMs). Our findings reveal significant performance degradation across traditional models when confronted with semantically conflicting data and point out the drawbacks of MLLMs when handling multi-modal emotion analysis. Our research presents a new challenge and offer valuable insights for the future development of sentiment analysis systems.
format Preprint
id arxiv_https___arxiv_org_abs_2406_03004
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Evaluation of data inconsistency for multi-modal sentiment analysis
Wang, Yufei
Wu, Mengyue
Computation and Language
Artificial Intelligence
Emotion semantic inconsistency is an ubiquitous challenge in multi-modal sentiment analysis (MSA). MSA involves analyzing sentiment expressed across various modalities like text, audio, and videos. Each modality may convey distinct aspects of sentiment, due to subtle and nuanced expression of human beings, leading to inconsistency, which may hinder the prediction of artificial agents. In this work, we introduce a modality conflicting test set and assess the performance of both traditional multi-modal sentiment analysis models and multi-modal large language models (MLLMs). Our findings reveal significant performance degradation across traditional models when confronted with semantically conflicting data and point out the drawbacks of MLLMs when handling multi-modal emotion analysis. Our research presents a new challenge and offer valuable insights for the future development of sentiment analysis systems.
title Evaluation of data inconsistency for multi-modal sentiment analysis
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2406.03004