:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bonaldi, Helena, Chung, Yi-Ling, Abercrombie, Gavin, Guerini, Marco
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2403.20103
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Assisted Counterspeech Writing at the Crossroads of Hate Speech and Misinformation
by: Martone, Genoveffa, et al.
Published: (2026)

Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech
by: Dinkar, Tanvi, et al.
Published: (2025)

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
by: Bonaldi, Helena, et al.
Published: (2024)

Multilingual Hate Speech Detection and Counterspeech Generation: A Comprehensive Survey and Practical Guide
by: Fesaghandis, Zahra Safdari, et al.
Published: (2026)

Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection
by: Curry, Amanda Cercas, et al.
Published: (2024)

Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
by: Bengoetxea, Jaione, et al.
Published: (2024)

Assessing How Hate, Counterspeech, and Toxicity Affect Hate Group Newcomers
by: Hickey, Daniel, et al.
Published: (2024)

Web(er) of Hate: A Survey on How Hate Speech Is Typed
by: Wang, Luna, et al.
Published: (2025)

Erasing 'Ugly' from the Internet: Propagation of the Beauty Myth in Text-Image Models
by: Dinkar, Tanvi, et al.
Published: (2025)

Food Noise & False Safety: A Systematic Evaluation of How LLMs Fail to Adapt to Eating Disorder Queries with Clinician Feedback
by: Pucci, Giulia, et al.
Published: (2026)

NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps
by: Gligoric, Kristina, et al.
Published: (2024)

Counterspeech the ultimate shield! Multi-Conditioned Counterspeech Generation through Attributed Prefix Learning
by: Kumar, Aswini, et al.
Published: (2025)

Re-examining Sexism and Misogyny Classification with Annotator Attitudes
by: Jiang, Aiqi, et al.
Published: (2024)

HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation
by: Rizwan, Naquee, et al.
Published: (2025)

CODEOFCONDUCT at Multilingual Counterspeech Generation: A Context-Aware Model for Robust Counterspeech Generation in Low-Resource Languages
by: Bennie, Michael, et al.
Published: (2025)

Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
by: Abercrombie, Gavin, et al.
Published: (2023)

On Zero-Shot Counterspeech Generation by LLMs
by: Saha, Punyajoy, et al.
Published: (2024)

Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language
by: Saffari, Hamidreza, et al.
Published: (2024)

Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories
by: Lisker, Mareike, et al.
Published: (2025)

Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate
by: Ngueajio, Mikel K., et al.
Published: (2025)

LLMberjack: Guided Trimming of Debate Trees for Multi-Party Conversation Creation
by: Bottona, Leonardo, et al.
Published: (2026)

Assessing the Human Likeness of AI-Generated Counterspeech
by: Song, Xiaoying, et al.
Published: (2024)

Echoes of Discord: Forecasting Hater Reactions to Counterspeech
by: Song, Xiaoying, et al.
Published: (2025)

When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation
by: Occhipinti, Daniela, et al.
Published: (2025)

Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
by: Gajewska, Ewelina, et al.
Published: (2025)

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
by: Plaza-del-Arco, Flor Miriam, et al.
Published: (2024)

HateModerate: Testing Hate Speech Detectors against Content Moderation Policies
by: Zheng, Jiangrui, et al.
Published: (2023)

PRODIGy: a PROfile-based DIalogue Generation dataset
by: Occhipinti, Daniela, et al.
Published: (2023)

A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs
by: Jahan, Md Saroar, et al.
Published: (2024)

Face the Facts! Evaluating RAG-based Pipelines for Professional Fact-Checking
by: Russo, Daniel, et al.
Published: (2024)

NLP for The Greek Language: A Longer Survey
by: Papantoniou, Katerina, et al.
Published: (2024)

Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi
by: Das, Mithun, et al.
Published: (2024)

State of NLP in Kenya: A Survey
by: Amol, Cynthia Jayne, et al.
Published: (2024)

Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations
by: Penzo, Nicolò, et al.
Published: (2024)

Towards Faithful Model Explanation in NLP: A Survey
by: Lyu, Qing, et al.
Published: (2022)

A Survey of Cognitive Distortion Detection and Classification in NLP
by: Sage, Archie, et al.
Published: (2025)

Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025)

CrisiText: A dataset of warning messages for LLM training in emergency communication
by: Gonella, Giacomo, et al.
Published: (2025)

Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025)

NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey
by: Goswami, Dhiman, et al.
Published: (2026)