Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yu, Jianxing, Wang, Shiqi, Yin, Han, Sun, Zhenlong, Xie, Ruobing, Zhang, Bo, Rao, Yanghui
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.07673
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917799960510464
author	Yu, Jianxing Wang, Shiqi Yin, Han Sun, Zhenlong Xie, Ruobing Zhang, Bo Rao, Yanghui
author_facet	Yu, Jianxing Wang, Shiqi Yin, Han Sun, Zhenlong Xie, Ruobing Zhang, Bo Rao, Yanghui
contents	This paper focuses on detecting clickbait posts on the Web. These posts often use eye-catching disinformation in mixed modalities to mislead users to click for profit. That affects the user experience and thus would be blocked by content provider. To escape detection, malicious creators use tricks to add some irrelevant non-bait content into bait posts, dressing them up as legal to fool the detector. This content often has biased relations with non-bait labels, yet traditional detectors tend to make predictions based on simple co-occurrence rather than grasping inherent factors that lead to malicious behavior. This spurious bias would easily cause misjudgments. To address this problem, we propose a new debiased method based on causal inference. We first employ a set of features in multiple modalities to characterize the posts. Considering these features are often mixed up with unknown biases, we then disentangle three kinds of latent factors from them, including the invariant factor that indicates intrinsic bait intention; the causal factor which reflects deceptive patterns in a certain scenario, and non-causal noise. By eliminating the noise that causes bias, we can use invariant and causal factors to build a robust model with good generalization ability. Experiments on three popular datasets show the effectiveness of our approach.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_07673
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference Yu, Jianxing Wang, Shiqi Yin, Han Sun, Zhenlong Xie, Ruobing Zhang, Bo Rao, Yanghui Machine Learning Artificial Intelligence This paper focuses on detecting clickbait posts on the Web. These posts often use eye-catching disinformation in mixed modalities to mislead users to click for profit. That affects the user experience and thus would be blocked by content provider. To escape detection, malicious creators use tricks to add some irrelevant non-bait content into bait posts, dressing them up as legal to fool the detector. This content often has biased relations with non-bait labels, yet traditional detectors tend to make predictions based on simple co-occurrence rather than grasping inherent factors that lead to malicious behavior. This spurious bias would easily cause misjudgments. To address this problem, we propose a new debiased method based on causal inference. We first employ a set of features in multiple modalities to characterize the posts. Considering these features are often mixed up with unknown biases, we then disentangle three kinds of latent factors from them, including the invariant factor that indicates intrinsic bait intention; the causal factor which reflects deceptive patterns in a certain scenario, and non-causal noise. By eliminating the noise that causes bias, we can use invariant and causal factors to build a robust model with good generalization ability. Experiments on three popular datasets show the effectiveness of our approach.
title	Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2410.07673

Similar Items