Saved in:
Bibliographic Details
Main Authors: Shamsabad, Marzieh Adeli, Ghodrati, Hamed
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.16108
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915747822829568
author Shamsabad, Marzieh Adeli
Ghodrati, Hamed
author_facet Shamsabad, Marzieh Adeli
Ghodrati, Hamed
contents Climate disinformation has become a major challenge in today digital world, especially with the rise of misleading images and videos shared widely on social media. These false claims are often convincing and difficult to detect, which can delay actions on climate change. While vision-language models (VLMs) have been used to identify visual disinformation, they rely only on the knowledge available at the time of training. This limits their ability to reason about recent events or updates. The main goal of this paper is to overcome that limitation by combining VLMs with external knowledge. By retrieving up-to-date information such as reverse image results, online fact-checks, and trusted expert content, the system can better assess whether an image and its claim are accurate, misleading, false, or unverifiable. This approach improves the model ability to handle real-world climate disinformation and supports efforts to protect public understanding of science in a rapidly changing information landscape.
format Preprint
id arxiv_https___arxiv_org_abs_2601_16108
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Multimodal Climate Disinformation Detection: Integrating Vision-Language Models with External Knowledge Sources
Shamsabad, Marzieh Adeli
Ghodrati, Hamed
Artificial Intelligence
Climate disinformation has become a major challenge in today digital world, especially with the rise of misleading images and videos shared widely on social media. These false claims are often convincing and difficult to detect, which can delay actions on climate change. While vision-language models (VLMs) have been used to identify visual disinformation, they rely only on the knowledge available at the time of training. This limits their ability to reason about recent events or updates. The main goal of this paper is to overcome that limitation by combining VLMs with external knowledge. By retrieving up-to-date information such as reverse image results, online fact-checks, and trusted expert content, the system can better assess whether an image and its claim are accurate, misleading, false, or unverifiable. This approach improves the model ability to handle real-world climate disinformation and supports efforts to protect public understanding of science in a rapidly changing information landscape.
title Multimodal Climate Disinformation Detection: Integrating Vision-Language Models with External Knowledge Sources
topic Artificial Intelligence
url https://arxiv.org/abs/2601.16108