Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.15517 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866914981258199040 |
|---|---|
| author | Joshi, Swarang Mavani, Siddharth Alex, Joel Negi, Arnav Mishra, Rahul Kumaraguru, Ponnurangam |
| author_facet | Joshi, Swarang Mavani, Siddharth Alex, Joel Negi, Arnav Mishra, Rahul Kumaraguru, Ponnurangam |
| contents | Misinformation undermines individual knowledge and affects broader societal narratives. Despite growing interest in the research community in multi-modal misinformation detection, existing methods exhibit limitations in capturing semantic cues, key regions, and cross-modal similarities within multi-modal datasets. We propose SceneGraMMi, a Scene Graph-boosted Hybrid-fusion approach for Multi-modal Misinformation veracity prediction, which integrates scene graphs across different modalities to improve detection performance. Experimental results across four benchmark datasets show that SceneGraMMi consistently outperforms state-of-the-art methods. In a comprehensive ablation study, we highlight the contribution of each component, while Shapley values are employed to examine the explainability of the model's decision-making process. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2410_15517 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction Joshi, Swarang Mavani, Siddharth Alex, Joel Negi, Arnav Mishra, Rahul Kumaraguru, Ponnurangam Computation and Language Misinformation undermines individual knowledge and affects broader societal narratives. Despite growing interest in the research community in multi-modal misinformation detection, existing methods exhibit limitations in capturing semantic cues, key regions, and cross-modal similarities within multi-modal datasets. We propose SceneGraMMi, a Scene Graph-boosted Hybrid-fusion approach for Multi-modal Misinformation veracity prediction, which integrates scene graphs across different modalities to improve detection performance. Experimental results across four benchmark datasets show that SceneGraMMi consistently outperforms state-of-the-art methods. In a comprehensive ablation study, we highlight the contribution of each component, while Shapley values are employed to examine the explainability of the model's decision-making process. |
| title | SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction |
| topic | Computation and Language |
| url | https://arxiv.org/abs/2410.15517 |