Saved in:
Bibliographic Details
Main Authors: Zhang, Liyuan, Cheng, Zeyun, Yang, Yan, Liu, Yong, Ma, Jinke
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.20286
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909708705595392
author Zhang, Liyuan
Cheng, Zeyun
Yang, Yan
Liu, Yong
Ma, Jinke
author_facet Zhang, Liyuan
Cheng, Zeyun
Yang, Yan
Liu, Yong
Ma, Jinke
contents The existing methods for fake news videos detection may not be generalized, because there is a distribution shift between short video news of different events, and the performance of such techniques greatly drops if news records are coming from emergencies. We propose a new fake news videos detection framework (T$^3$SVFND) using Test-Time Training (TTT) to alleviate this limitation, enhancing the robustness of fake news videos detection. Specifically, we design a self-supervised auxiliary task based on Mask Language Modeling (MLM) that masks a certain percentage of words in text and predicts these masked words by combining contextual information from different modalities (audio and video). In the test-time training phase, the model adapts to the distribution of test data through auxiliary tasks. Extensive experiments on the public benchmark demonstrate the effectiveness of the proposed model, especially for the detection of emergency news.
format Preprint
id arxiv_https___arxiv_org_abs_2507_20286
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle T$^\text{3}$SVFND: Towards an Evolving Fake News Detector for Emergencies with Test-time Training on Short Video Platforms
Zhang, Liyuan
Cheng, Zeyun
Yang, Yan
Liu, Yong
Ma, Jinke
Computer Vision and Pattern Recognition
Multimedia
The existing methods for fake news videos detection may not be generalized, because there is a distribution shift between short video news of different events, and the performance of such techniques greatly drops if news records are coming from emergencies. We propose a new fake news videos detection framework (T$^3$SVFND) using Test-Time Training (TTT) to alleviate this limitation, enhancing the robustness of fake news videos detection. Specifically, we design a self-supervised auxiliary task based on Mask Language Modeling (MLM) that masks a certain percentage of words in text and predicts these masked words by combining contextual information from different modalities (audio and video). In the test-time training phase, the model adapts to the distribution of test data through auxiliary tasks. Extensive experiments on the public benchmark demonstrate the effectiveness of the proposed model, especially for the detection of emergency news.
title T$^\text{3}$SVFND: Towards an Evolving Fake News Detector for Emergencies with Test-time Training on Short Video Platforms
topic Computer Vision and Pattern Recognition
Multimedia
url https://arxiv.org/abs/2507.20286