Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Liyuan, Cheng, Zeyun, Yang, Yan, Liu, Yong, Ma, Jinke
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Multimedia
Online Access:	https://arxiv.org/abs/2507.20286
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909708705595392
author	Zhang, Liyuan Cheng, Zeyun Yang, Yan Liu, Yong Ma, Jinke
author_facet	Zhang, Liyuan Cheng, Zeyun Yang, Yan Liu, Yong Ma, Jinke
contents	The existing methods for fake news videos detection may not be generalized, because there is a distribution shift between short video news of different events, and the performance of such techniques greatly drops if news records are coming from emergencies. We propose a new fake news videos detection framework (T$^3$SVFND) using Test-Time Training (TTT) to alleviate this limitation, enhancing the robustness of fake news videos detection. Specifically, we design a self-supervised auxiliary task based on Mask Language Modeling (MLM) that masks a certain percentage of words in text and predicts these masked words by combining contextual information from different modalities (audio and video). In the test-time training phase, the model adapts to the distribution of test data through auxiliary tasks. Extensive experiments on the public benchmark demonstrate the effectiveness of the proposed model, especially for the detection of emergency news.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_20286
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	T$^\text{3}$SVFND: Towards an Evolving Fake News Detector for Emergencies with Test-time Training on Short Video Platforms Zhang, Liyuan Cheng, Zeyun Yang, Yan Liu, Yong Ma, Jinke Computer Vision and Pattern Recognition Multimedia The existing methods for fake news videos detection may not be generalized, because there is a distribution shift between short video news of different events, and the performance of such techniques greatly drops if news records are coming from emergencies. We propose a new fake news videos detection framework (T$^3$SVFND) using Test-Time Training (TTT) to alleviate this limitation, enhancing the robustness of fake news videos detection. Specifically, we design a self-supervised auxiliary task based on Mask Language Modeling (MLM) that masks a certain percentage of words in text and predicts these masked words by combining contextual information from different modalities (audio and video). In the test-time training phase, the model adapts to the distribution of test data through auxiliary tasks. Extensive experiments on the public benchmark demonstrate the effectiveness of the proposed model, especially for the detection of emergency news.
title	T$^\text{3}$SVFND: Towards an Evolving Fake News Detector for Emergencies with Test-time Training on Short Video Platforms
topic	Computer Vision and Pattern Recognition Multimedia
url	https://arxiv.org/abs/2507.20286

Similar Items