Saved in:
Bibliographic Details
Main Authors: Joshi, Swarang, Mavani, Siddharth, Alex, Joel, Negi, Arnav, Mishra, Rahul, Kumaraguru, Ponnurangam
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.15517
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914981258199040
author Joshi, Swarang
Mavani, Siddharth
Alex, Joel
Negi, Arnav
Mishra, Rahul
Kumaraguru, Ponnurangam
author_facet Joshi, Swarang
Mavani, Siddharth
Alex, Joel
Negi, Arnav
Mishra, Rahul
Kumaraguru, Ponnurangam
contents Misinformation undermines individual knowledge and affects broader societal narratives. Despite growing interest in the research community in multi-modal misinformation detection, existing methods exhibit limitations in capturing semantic cues, key regions, and cross-modal similarities within multi-modal datasets. We propose SceneGraMMi, a Scene Graph-boosted Hybrid-fusion approach for Multi-modal Misinformation veracity prediction, which integrates scene graphs across different modalities to improve detection performance. Experimental results across four benchmark datasets show that SceneGraMMi consistently outperforms state-of-the-art methods. In a comprehensive ablation study, we highlight the contribution of each component, while Shapley values are employed to examine the explainability of the model's decision-making process.
format Preprint
id arxiv_https___arxiv_org_abs_2410_15517
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction
Joshi, Swarang
Mavani, Siddharth
Alex, Joel
Negi, Arnav
Mishra, Rahul
Kumaraguru, Ponnurangam
Computation and Language
Misinformation undermines individual knowledge and affects broader societal narratives. Despite growing interest in the research community in multi-modal misinformation detection, existing methods exhibit limitations in capturing semantic cues, key regions, and cross-modal similarities within multi-modal datasets. We propose SceneGraMMi, a Scene Graph-boosted Hybrid-fusion approach for Multi-modal Misinformation veracity prediction, which integrates scene graphs across different modalities to improve detection performance. Experimental results across four benchmark datasets show that SceneGraMMi consistently outperforms state-of-the-art methods. In a comprehensive ablation study, we highlight the contribution of each component, while Shapley values are employed to examine the explainability of the model's decision-making process.
title SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction
topic Computation and Language
url https://arxiv.org/abs/2410.15517