Saved in:
Bibliographic Details
Main Authors: Rawa, Jan, Sienkiewicz, Julian
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.00496
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917416251949056
author Rawa, Jan
Sienkiewicz, Julian
author_facet Rawa, Jan
Sienkiewicz, Julian
contents Information overload (IOL) is a well-known and devastating phenomenon that alters the performance of carrying out all types of tasks. It has been shown that in the media space, IOL can contribute to news fatigue and news avoidance, which often leads to the proliferation of fake news posts on social networks. However, there is a lack of automatic methods that can be used to track IOL in large datasets. In this study, we investigate whether the Gini index calculated from the distribution of topics obtained via the BERTopic model can be considered a proxy for IOL. We test our assumptions on a set of Reddit communities related to the COVID-19 pandemic and obtain a significant global correlation between the Gini index and the fraction of fake news detected by the FakeBERT classifier. However, at the community level, the correlation analysis results are ambiguous.
format Preprint
id arxiv_https___arxiv_org_abs_2601_00496
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Quantifying correlations between information overload and fake news during COVID-19 pandemic: a Reddit study with BERT model approach
Rawa, Jan
Sienkiewicz, Julian
Social and Information Networks
Information overload (IOL) is a well-known and devastating phenomenon that alters the performance of carrying out all types of tasks. It has been shown that in the media space, IOL can contribute to news fatigue and news avoidance, which often leads to the proliferation of fake news posts on social networks. However, there is a lack of automatic methods that can be used to track IOL in large datasets. In this study, we investigate whether the Gini index calculated from the distribution of topics obtained via the BERTopic model can be considered a proxy for IOL. We test our assumptions on a set of Reddit communities related to the COVID-19 pandemic and obtain a significant global correlation between the Gini index and the fraction of fake news detected by the FakeBERT classifier. However, at the community level, the correlation analysis results are ambiguous.
title Quantifying correlations between information overload and fake news during COVID-19 pandemic: a Reddit study with BERT model approach
topic Social and Information Networks
url https://arxiv.org/abs/2601.00496