Saved in:
| Main Authors: | Bonaldi, Helena, Chung, Yi-Ling, Abercrombie, Gavin, Guerini, Marco |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.20103 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Assisted Counterspeech Writing at the Crossroads of Hate Speech and Misinformation
by: Martone, Genoveffa, et al.
Published: (2026)
by: Martone, Genoveffa, et al.
Published: (2026)
Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech
by: Dinkar, Tanvi, et al.
Published: (2025)
by: Dinkar, Tanvi, et al.
Published: (2025)
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
by: Bonaldi, Helena, et al.
Published: (2024)
by: Bonaldi, Helena, et al.
Published: (2024)
Multilingual Hate Speech Detection and Counterspeech Generation: A Comprehensive Survey and Practical Guide
by: Fesaghandis, Zahra Safdari, et al.
Published: (2026)
by: Fesaghandis, Zahra Safdari, et al.
Published: (2026)
Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection
by: Curry, Amanda Cercas, et al.
Published: (2024)
by: Curry, Amanda Cercas, et al.
Published: (2024)
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
by: Bengoetxea, Jaione, et al.
Published: (2024)
by: Bengoetxea, Jaione, et al.
Published: (2024)
Assessing How Hate, Counterspeech, and Toxicity Affect Hate Group Newcomers
by: Hickey, Daniel, et al.
Published: (2024)
by: Hickey, Daniel, et al.
Published: (2024)
Web(er) of Hate: A Survey on How Hate Speech Is Typed
by: Wang, Luna, et al.
Published: (2025)
by: Wang, Luna, et al.
Published: (2025)
Erasing 'Ugly' from the Internet: Propagation of the Beauty Myth in Text-Image Models
by: Dinkar, Tanvi, et al.
Published: (2025)
by: Dinkar, Tanvi, et al.
Published: (2025)
Food Noise & False Safety: A Systematic Evaluation of How LLMs Fail to Adapt to Eating Disorder Queries with Clinician Feedback
by: Pucci, Giulia, et al.
Published: (2026)
by: Pucci, Giulia, et al.
Published: (2026)
NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps
by: Gligoric, Kristina, et al.
Published: (2024)
by: Gligoric, Kristina, et al.
Published: (2024)
Counterspeech the ultimate shield! Multi-Conditioned Counterspeech Generation through Attributed Prefix Learning
by: Kumar, Aswini, et al.
Published: (2025)
by: Kumar, Aswini, et al.
Published: (2025)
Re-examining Sexism and Misogyny Classification with Annotator Attitudes
by: Jiang, Aiqi, et al.
Published: (2024)
by: Jiang, Aiqi, et al.
Published: (2024)
HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation
by: Rizwan, Naquee, et al.
Published: (2025)
by: Rizwan, Naquee, et al.
Published: (2025)
CODEOFCONDUCT at Multilingual Counterspeech Generation: A Context-Aware Model for Robust Counterspeech Generation in Low-Resource Languages
by: Bennie, Michael, et al.
Published: (2025)
by: Bennie, Michael, et al.
Published: (2025)
Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
by: Abercrombie, Gavin, et al.
Published: (2023)
by: Abercrombie, Gavin, et al.
Published: (2023)
On Zero-Shot Counterspeech Generation by LLMs
by: Saha, Punyajoy, et al.
Published: (2024)
by: Saha, Punyajoy, et al.
Published: (2024)
Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language
by: Saffari, Hamidreza, et al.
Published: (2024)
by: Saffari, Hamidreza, et al.
Published: (2024)
Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories
by: Lisker, Mareike, et al.
Published: (2025)
by: Lisker, Mareike, et al.
Published: (2025)
Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate
by: Ngueajio, Mikel K., et al.
Published: (2025)
by: Ngueajio, Mikel K., et al.
Published: (2025)
LLMberjack: Guided Trimming of Debate Trees for Multi-Party Conversation Creation
by: Bottona, Leonardo, et al.
Published: (2026)
by: Bottona, Leonardo, et al.
Published: (2026)
Assessing the Human Likeness of AI-Generated Counterspeech
by: Song, Xiaoying, et al.
Published: (2024)
by: Song, Xiaoying, et al.
Published: (2024)
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation
by: Occhipinti, Daniela, et al.
Published: (2025)
by: Occhipinti, Daniela, et al.
Published: (2025)
Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
by: Gajewska, Ewelina, et al.
Published: (2025)
by: Gajewska, Ewelina, et al.
Published: (2025)
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
by: Plaza-del-Arco, Flor Miriam, et al.
Published: (2024)
by: Plaza-del-Arco, Flor Miriam, et al.
Published: (2024)
HateModerate: Testing Hate Speech Detectors against Content Moderation Policies
by: Zheng, Jiangrui, et al.
Published: (2023)
by: Zheng, Jiangrui, et al.
Published: (2023)
PRODIGy: a PROfile-based DIalogue Generation dataset
by: Occhipinti, Daniela, et al.
Published: (2023)
by: Occhipinti, Daniela, et al.
Published: (2023)
A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs
by: Jahan, Md Saroar, et al.
Published: (2024)
by: Jahan, Md Saroar, et al.
Published: (2024)
Face the Facts! Evaluating RAG-based Pipelines for Professional Fact-Checking
by: Russo, Daniel, et al.
Published: (2024)
by: Russo, Daniel, et al.
Published: (2024)
NLP for The Greek Language: A Longer Survey
by: Papantoniou, Katerina, et al.
Published: (2024)
by: Papantoniou, Katerina, et al.
Published: (2024)
Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi
by: Das, Mithun, et al.
Published: (2024)
by: Das, Mithun, et al.
Published: (2024)
State of NLP in Kenya: A Survey
by: Amol, Cynthia Jayne, et al.
Published: (2024)
by: Amol, Cynthia Jayne, et al.
Published: (2024)
Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations
by: Penzo, Nicolò, et al.
Published: (2024)
by: Penzo, Nicolò, et al.
Published: (2024)
Towards Faithful Model Explanation in NLP: A Survey
by: Lyu, Qing, et al.
Published: (2022)
by: Lyu, Qing, et al.
Published: (2022)
A Survey of Cognitive Distortion Detection and Classification in NLP
by: Sage, Archie, et al.
Published: (2025)
by: Sage, Archie, et al.
Published: (2025)
Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
CrisiText: A dataset of warning messages for LLM training in emergency communication
by: Gonella, Giacomo, et al.
Published: (2025)
by: Gonella, Giacomo, et al.
Published: (2025)
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025)
by: Anik, Anirban Saha, et al.
Published: (2025)
NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey
by: Goswami, Dhiman, et al.
Published: (2026)
by: Goswami, Dhiman, et al.
Published: (2026)
Similar Items
-
Assisted Counterspeech Writing at the Crossroads of Hate Speech and Misinformation
by: Martone, Genoveffa, et al.
Published: (2026) -
Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech
by: Dinkar, Tanvi, et al.
Published: (2025) -
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
by: Bonaldi, Helena, et al.
Published: (2024) -
Multilingual Hate Speech Detection and Counterspeech Generation: A Comprehensive Survey and Practical Guide
by: Fesaghandis, Zahra Safdari, et al.
Published: (2026) -
Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection
by: Curry, Amanda Cercas, et al.
Published: (2024)