Enregistré dans:
Détails bibliographiques
Auteurs principaux: Lasser, Jana, Herderich, Alina, Garland, Joshua, Aroyehun, Segun Taofeek, Garcia, David, Galesic, Mirta
Format: Preprint
Publié: 2023
Sujets:
Accès en ligne:https://arxiv.org/abs/2303.00357
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866908667676196864
author Lasser, Jana
Herderich, Alina
Garland, Joshua
Aroyehun, Segun Taofeek
Garcia, David
Galesic, Mirta
author_facet Lasser, Jana
Herderich, Alina
Garland, Joshua
Aroyehun, Segun Taofeek
Garcia, David
Galesic, Mirta
contents In the digital age, hate speech poses a threat to the functioning of social media platforms as spaces for public discourse. Top-down approaches to moderate hate speech encounter difficulties due to conflicts with freedom of expression and issues of scalability. Counter speech, a form of collective moderation by citizens, has emerged as a potential remedy. Here, we aim to investigate which counter speech strategies are most effective in reducing the prevalence of hate, toxicity, and extremity on online platforms. We analyze more than 130,000 discussions on German Twitter starting at the peak of the migrant crisis in 2015 and extending over four years. We use human annotation and machine learning classifiers to identify argumentation strategies, ingroup and outgroup references, emotional tone, and different measures of discourse quality. Using matching and time-series analyses we discern the effectiveness of naturally observed counter speech strategies on the micro-level (individual tweet pairs), meso-level (entire discussions) and macro-level (over days). We find that expressing straightforward opinions, even if not factual but devoid of insults, results in the least subsequent hate, toxicity, and extremity over all levels of analyses. This strategy complements currently recommended counter speech strategies and is easy for citizens to engage in. Sarcasm can also be effective in improving discourse quality, especially in the presence of organized extreme groups. Going beyond one-shot analyses on smaller samples prevalent in most prior studies, our findings have implications for the successful management of public online spaces through collective civic moderation.
format Preprint
id arxiv_https___arxiv_org_abs_2303_00357
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Collective moderation of hate, toxicity, and extremity in online discussions
Lasser, Jana
Herderich, Alina
Garland, Joshua
Aroyehun, Segun Taofeek
Garcia, David
Galesic, Mirta
Computers and Society
In the digital age, hate speech poses a threat to the functioning of social media platforms as spaces for public discourse. Top-down approaches to moderate hate speech encounter difficulties due to conflicts with freedom of expression and issues of scalability. Counter speech, a form of collective moderation by citizens, has emerged as a potential remedy. Here, we aim to investigate which counter speech strategies are most effective in reducing the prevalence of hate, toxicity, and extremity on online platforms. We analyze more than 130,000 discussions on German Twitter starting at the peak of the migrant crisis in 2015 and extending over four years. We use human annotation and machine learning classifiers to identify argumentation strategies, ingroup and outgroup references, emotional tone, and different measures of discourse quality. Using matching and time-series analyses we discern the effectiveness of naturally observed counter speech strategies on the micro-level (individual tweet pairs), meso-level (entire discussions) and macro-level (over days). We find that expressing straightforward opinions, even if not factual but devoid of insults, results in the least subsequent hate, toxicity, and extremity over all levels of analyses. This strategy complements currently recommended counter speech strategies and is easy for citizens to engage in. Sarcasm can also be effective in improving discourse quality, especially in the presence of organized extreme groups. Going beyond one-shot analyses on smaller samples prevalent in most prior studies, our findings have implications for the successful management of public online spaces through collective civic moderation.
title Collective moderation of hate, toxicity, and extremity in online discussions
topic Computers and Society
url https://arxiv.org/abs/2303.00357