:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hong, Lingzi, Luo, Pengcheng, Blanco, Eduardo, Song, Xiaoying
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2403.17146
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Assessing the Human Likeness of AI-Generated Counterspeech
by: Song, Xiaoying, et al.
Published: (2024)

Echoes of Discord: Forecasting Hater Reactions to Counterspeech
by: Song, Xiaoying, et al.
Published: (2025)

A Dynamic Fusion Model for Consistent Crisis Response
by: Song, Xiaoying, et al.
Published: (2025)

Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025)

A Hybrid Framework for Subject Analysis: Integrating Embedding-Based Regression Models with Large Language Models
by: Liu, Jinyu, et al.
Published: (2025)

Dialogues of Dissent: Thematic and Rhetorical Dimensions of Hate and Counter-Hate Speech in Social Media Conversations
by: Levi, Effi, et al.
Published: (2025)

Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models
by: Yuan, Shuzhou, et al.
Published: (2025)

Consolidating Strategies for Countering Hate Speech Using Persuasive Dialogues
by: Saha, Sougata, et al.
Published: (2024)

Distilling Knowledge from Large Language Models: A Concept Bottleneck Model for Hate and Counter Speech Recognition
by: Labadie-Tamayo, Roberto, et al.
Published: (2025)

Exploring the Plausibility of Hate and Counter Speech Detectors with Explainable AI
by: Böck, Adrian Jaques, et al.
Published: (2024)

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
by: Bonaldi, Helena, et al.
Published: (2024)

PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions
by: Damo, Greta, et al.
Published: (2026)

Decoding Hate: Exploring Language Models' Reactions to Hate Speech
by: Piot, Paloma, et al.
Published: (2024)

HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
by: Sen, Tanmay, et al.
Published: (2024)

ReZG: Retrieval-Augmented Zero-Shot Counter Narrative Generation for Hate Speech
by: Jiang, Shuyu, et al.
Published: (2023)

Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025)

Exploring Large Language Models for Hate Speech Detection in Rioplatense Spanish
by: Pérez, Juan Manuel, et al.
Published: (2024)

Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement
by: Nge, Brian Jing Hong, et al.
Published: (2026)

Hatred Stems from Ignorance! Distillation of the Persuasion Modes in Countering Conversational Hate Speech
by: Alyahya, Ghadi, et al.
Published: (2024)

Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
by: Bui, Minh Duc, et al.
Published: (2024)

Detecting Anti-Semitic Hate Speech using Transformer-based Large Language Models
by: Liu, Dengyi, et al.
Published: (2024)

Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification
by: Im, Kyuri, et al.
Published: (2026)

Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech
by: Ludwig, Florian, et al.
Published: (2025)

Investigating Annotator Bias in Large Language Models for Hate Speech Detection
by: Das, Amit, et al.
Published: (2024)

Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection
by: Kumarage, Tharindu, et al.
Published: (2024)

AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)

An Investigation of Large Language Models for Real-World Hate Speech Detection
by: Guo, Keyan, et al.
Published: (2024)

Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models
by: Hee, Ming Shan, et al.
Published: (2024)

HateDebias: On the Diversity and Variability of Hate Speech Debiasing
by: Wu, Hongyan, et al.
Published: (2024)

COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
by: Zhang, Linhao, et al.
Published: (2024)

Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
by: Nirmal, Ayushi, et al.
Published: (2024)

EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
by: Ilevbare, Comfort Eseohen, et al.
Published: (2024)

Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models
by: Piot, Paloma, et al.
Published: (2025)

"Is Hate Lost in Translation?": Evaluation of Multilingual LGBTQIA+ Hate Speech Detection
by: Chan, Fai Leui, et al.
Published: (2024)

Web(er) of Hate: A Survey on How Hate Speech Is Typed
by: Wang, Luna, et al.
Published: (2025)

Translate-and-Revise: Boosting Large Language Models for Constrained Translation
by: Huang, Pengcheng, et al.
Published: (2024)

LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
by: Yuan, Shuzhou, et al.
Published: (2025)

Large Language Models and Thematic Analysis: Human-AI Synergy in Researching Hate Speech on Social Media
by: Breazu, Petre, et al.
Published: (2024)

Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language
by: Saffari, Hamidreza, et al.
Published: (2024)

Bangla Hate Speech Classification with Fine-tuned Transformer Models
by: Jafari, Yalda Keivan, et al.
Published: (2025)