Saved in:
Bibliographic Details
Main Authors: Yu, Jianxing, Wang, Shiqi, Yin, Han, Sun, Zhenlong, Xie, Ruobing, Zhang, Bo, Rao, Yanghui
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.07673
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917799960510464
author Yu, Jianxing
Wang, Shiqi
Yin, Han
Sun, Zhenlong
Xie, Ruobing
Zhang, Bo
Rao, Yanghui
author_facet Yu, Jianxing
Wang, Shiqi
Yin, Han
Sun, Zhenlong
Xie, Ruobing
Zhang, Bo
Rao, Yanghui
contents This paper focuses on detecting clickbait posts on the Web. These posts often use eye-catching disinformation in mixed modalities to mislead users to click for profit. That affects the user experience and thus would be blocked by content provider. To escape detection, malicious creators use tricks to add some irrelevant non-bait content into bait posts, dressing them up as legal to fool the detector. This content often has biased relations with non-bait labels, yet traditional detectors tend to make predictions based on simple co-occurrence rather than grasping inherent factors that lead to malicious behavior. This spurious bias would easily cause misjudgments. To address this problem, we propose a new debiased method based on causal inference. We first employ a set of features in multiple modalities to characterize the posts. Considering these features are often mixed up with unknown biases, we then disentangle three kinds of latent factors from them, including the invariant factor that indicates intrinsic bait intention; the causal factor which reflects deceptive patterns in a certain scenario, and non-causal noise. By eliminating the noise that causes bias, we can use invariant and causal factors to build a robust model with good generalization ability. Experiments on three popular datasets show the effectiveness of our approach.
format Preprint
id arxiv_https___arxiv_org_abs_2410_07673
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Yu, Jianxing
Wang, Shiqi
Yin, Han
Sun, Zhenlong
Xie, Ruobing
Zhang, Bo
Rao, Yanghui
Machine Learning
Artificial Intelligence
This paper focuses on detecting clickbait posts on the Web. These posts often use eye-catching disinformation in mixed modalities to mislead users to click for profit. That affects the user experience and thus would be blocked by content provider. To escape detection, malicious creators use tricks to add some irrelevant non-bait content into bait posts, dressing them up as legal to fool the detector. This content often has biased relations with non-bait labels, yet traditional detectors tend to make predictions based on simple co-occurrence rather than grasping inherent factors that lead to malicious behavior. This spurious bias would easily cause misjudgments. To address this problem, we propose a new debiased method based on causal inference. We first employ a set of features in multiple modalities to characterize the posts. Considering these features are often mixed up with unknown biases, we then disentangle three kinds of latent factors from them, including the invariant factor that indicates intrinsic bait intention; the causal factor which reflects deceptive patterns in a certain scenario, and non-causal noise. By eliminating the noise that causes bias, we can use invariant and causal factors to build a robust model with good generalization ability. Experiments on three popular datasets show the effectiveness of our approach.
title Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2410.07673