Saved in:
Bibliographic Details
Main Authors: Lin, Guangyu, Lin, Li, Walker, Christina P., Schiff, Daniel S., Hu, Shu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.16556
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912677913165824
author Lin, Guangyu
Lin, Li
Walker, Christina P.
Schiff, Daniel S.
Hu, Shu
author_facet Lin, Guangyu
Lin, Li
Walker, Christina P.
Schiff, Daniel S.
Hu, Shu
contents The rapid proliferation of AI-generated content, driven by advances in generative adversarial networks, diffusion models, and multimodal large language models, has made the creation and dissemination of synthetic media effortless, heightening the risks of misinformation, particularly political deepfakes that distort truth and undermine trust in political institutions. In turn, governments, research institutions, and industry have strongly promoted deepfake detection initiatives as solutions. Yet, most existing models are trained and validated on synthetic, laboratory-controlled datasets, limiting their generalizability to the kinds of real-world political deepfakes circulating on social platforms that affect the public. In this work, we introduce the first systematic benchmark based on the Political Deepfakes Incident Database, a curated collection of real-world political deepfakes shared on social media since 2018. Our study includes a systematic evaluation of state-of-the-art deepfake detectors across academia, government, and industry. We find that the detectors from academia and government perform relatively poorly. While paid detection tools achieve relatively higher performance than free-access models, all evaluated detectors struggle to generalize effectively to authentic political deepfakes, and are vulnerable to simple manipulations, especially in the video domain. Results urge the need for politically contextualized deepfake detection frameworks to better safeguard the public in real-world settings.
format Preprint
id arxiv_https___arxiv_org_abs_2510_16556
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Fit for Purpose? Deepfake Detection in the Real World
Lin, Guangyu
Lin, Li
Walker, Christina P.
Schiff, Daniel S.
Hu, Shu
Computer Vision and Pattern Recognition
The rapid proliferation of AI-generated content, driven by advances in generative adversarial networks, diffusion models, and multimodal large language models, has made the creation and dissemination of synthetic media effortless, heightening the risks of misinformation, particularly political deepfakes that distort truth and undermine trust in political institutions. In turn, governments, research institutions, and industry have strongly promoted deepfake detection initiatives as solutions. Yet, most existing models are trained and validated on synthetic, laboratory-controlled datasets, limiting their generalizability to the kinds of real-world political deepfakes circulating on social platforms that affect the public. In this work, we introduce the first systematic benchmark based on the Political Deepfakes Incident Database, a curated collection of real-world political deepfakes shared on social media since 2018. Our study includes a systematic evaluation of state-of-the-art deepfake detectors across academia, government, and industry. We find that the detectors from academia and government perform relatively poorly. While paid detection tools achieve relatively higher performance than free-access models, all evaluated detectors struggle to generalize effectively to authentic political deepfakes, and are vulnerable to simple manipulations, especially in the video domain. Results urge the need for politically contextualized deepfake detection frameworks to better safeguard the public in real-world settings.
title Fit for Purpose? Deepfake Detection in the Real World
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2510.16556