Saved in:
| Main Authors: | Hong, Lingzi, Luo, Pengcheng, Blanco, Eduardo, Song, Xiaoying |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.17146 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Assessing the Human Likeness of AI-Generated Counterspeech
by: Song, Xiaoying, et al.
Published: (2024)
by: Song, Xiaoying, et al.
Published: (2024)
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
A Dynamic Fusion Model for Consistent Crisis Response
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025)
by: Song, Xiaoying, et al.
Published: (2025)
A Hybrid Framework for Subject Analysis: Integrating Embedding-Based Regression Models with Large Language Models
by: Liu, Jinyu, et al.
Published: (2025)
by: Liu, Jinyu, et al.
Published: (2025)
Dialogues of Dissent: Thematic and Rhetorical Dimensions of Hate and Counter-Hate Speech in Social Media Conversations
by: Levi, Effi, et al.
Published: (2025)
by: Levi, Effi, et al.
Published: (2025)
Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models
by: Yuan, Shuzhou, et al.
Published: (2025)
by: Yuan, Shuzhou, et al.
Published: (2025)
Consolidating Strategies for Countering Hate Speech Using Persuasive Dialogues
by: Saha, Sougata, et al.
Published: (2024)
by: Saha, Sougata, et al.
Published: (2024)
Distilling Knowledge from Large Language Models: A Concept Bottleneck Model for Hate and Counter Speech Recognition
by: Labadie-Tamayo, Roberto, et al.
Published: (2025)
by: Labadie-Tamayo, Roberto, et al.
Published: (2025)
Exploring the Plausibility of Hate and Counter Speech Detectors with Explainable AI
by: Böck, Adrian Jaques, et al.
Published: (2024)
by: Böck, Adrian Jaques, et al.
Published: (2024)
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
by: Bonaldi, Helena, et al.
Published: (2024)
by: Bonaldi, Helena, et al.
Published: (2024)
PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions
by: Damo, Greta, et al.
Published: (2026)
by: Damo, Greta, et al.
Published: (2026)
Decoding Hate: Exploring Language Models' Reactions to Hate Speech
by: Piot, Paloma, et al.
Published: (2024)
by: Piot, Paloma, et al.
Published: (2024)
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
by: Sen, Tanmay, et al.
Published: (2024)
by: Sen, Tanmay, et al.
Published: (2024)
ReZG: Retrieval-Augmented Zero-Shot Counter Narrative Generation for Hate Speech
by: Jiang, Shuyu, et al.
Published: (2023)
by: Jiang, Shuyu, et al.
Published: (2023)
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025)
by: Anik, Anirban Saha, et al.
Published: (2025)
Exploring Large Language Models for Hate Speech Detection in Rioplatense Spanish
by: Pérez, Juan Manuel, et al.
Published: (2024)
by: Pérez, Juan Manuel, et al.
Published: (2024)
Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement
by: Nge, Brian Jing Hong, et al.
Published: (2026)
by: Nge, Brian Jing Hong, et al.
Published: (2026)
Hatred Stems from Ignorance! Distillation of the Persuasion Modes in Countering Conversational Hate Speech
by: Alyahya, Ghadi, et al.
Published: (2024)
by: Alyahya, Ghadi, et al.
Published: (2024)
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
by: Bui, Minh Duc, et al.
Published: (2024)
by: Bui, Minh Duc, et al.
Published: (2024)
Detecting Anti-Semitic Hate Speech using Transformer-based Large Language Models
by: Liu, Dengyi, et al.
Published: (2024)
by: Liu, Dengyi, et al.
Published: (2024)
Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification
by: Im, Kyuri, et al.
Published: (2026)
by: Im, Kyuri, et al.
Published: (2026)
Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech
by: Ludwig, Florian, et al.
Published: (2025)
by: Ludwig, Florian, et al.
Published: (2025)
Investigating Annotator Bias in Large Language Models for Hate Speech Detection
by: Das, Amit, et al.
Published: (2024)
by: Das, Amit, et al.
Published: (2024)
Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection
by: Kumarage, Tharindu, et al.
Published: (2024)
by: Kumarage, Tharindu, et al.
Published: (2024)
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)
An Investigation of Large Language Models for Real-World Hate Speech Detection
by: Guo, Keyan, et al.
Published: (2024)
by: Guo, Keyan, et al.
Published: (2024)
Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models
by: Hee, Ming Shan, et al.
Published: (2024)
by: Hee, Ming Shan, et al.
Published: (2024)
HateDebias: On the Diversity and Variability of Hate Speech Debiasing
by: Wu, Hongyan, et al.
Published: (2024)
by: Wu, Hongyan, et al.
Published: (2024)
COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
by: Zhang, Linhao, et al.
Published: (2024)
by: Zhang, Linhao, et al.
Published: (2024)
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
by: Nirmal, Ayushi, et al.
Published: (2024)
by: Nirmal, Ayushi, et al.
Published: (2024)
EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
by: Ilevbare, Comfort Eseohen, et al.
Published: (2024)
by: Ilevbare, Comfort Eseohen, et al.
Published: (2024)
Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models
by: Piot, Paloma, et al.
Published: (2025)
by: Piot, Paloma, et al.
Published: (2025)
"Is Hate Lost in Translation?": Evaluation of Multilingual LGBTQIA+ Hate Speech Detection
by: Chan, Fai Leui, et al.
Published: (2024)
by: Chan, Fai Leui, et al.
Published: (2024)
Web(er) of Hate: A Survey on How Hate Speech Is Typed
by: Wang, Luna, et al.
Published: (2025)
by: Wang, Luna, et al.
Published: (2025)
Translate-and-Revise: Boosting Large Language Models for Constrained Translation
by: Huang, Pengcheng, et al.
Published: (2024)
by: Huang, Pengcheng, et al.
Published: (2024)
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
by: Yuan, Shuzhou, et al.
Published: (2025)
by: Yuan, Shuzhou, et al.
Published: (2025)
Large Language Models and Thematic Analysis: Human-AI Synergy in Researching Hate Speech on Social Media
by: Breazu, Petre, et al.
Published: (2024)
by: Breazu, Petre, et al.
Published: (2024)
Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language
by: Saffari, Hamidreza, et al.
Published: (2024)
by: Saffari, Hamidreza, et al.
Published: (2024)
Bangla Hate Speech Classification with Fine-tuned Transformer Models
by: Jafari, Yalda Keivan, et al.
Published: (2025)
by: Jafari, Yalda Keivan, et al.
Published: (2025)
Similar Items
-
Assessing the Human Likeness of AI-Generated Counterspeech
by: Song, Xiaoying, et al.
Published: (2024) -
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
by: Song, Xiaoying, et al.
Published: (2025) -
A Dynamic Fusion Model for Consistent Crisis Response
by: Song, Xiaoying, et al.
Published: (2025) -
Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
by: Song, Xiaoying, et al.
Published: (2025) -
A Hybrid Framework for Subject Analysis: Integrating Embedding-Based Regression Models with Large Language Models
by: Liu, Jinyu, et al.
Published: (2025)