:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liartis, Jason, Kaldeli, Eirini, Gyftokosta, Lambrini, Chelioudakis, Eleftherios, Mastromichalakis, Orfeas Menis
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2604.14970
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Don't Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)

Beyond One-Size-Fits-All: Adapting Counterfactual Explanations to User Objectives
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024)

The Grounding Gap: How LLMs Anchor the Meaning of Abstract Concepts Differently from Humans
by: Chlapanis, Odysseas S., et al.
Published: (2026)

Assumed Identities: Quantifying Gender Bias in Machine Translation of Gender-Ambiguous Occupational Terms
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)

Semantic Prototypes: Enhancing Transparency Without Black Boxes
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)

GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics
by: Filandrianos, Giorgos, et al.
Published: (2025)

Deep Ensemble Art Style Recognition
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)

AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking
by: Premptis, Iraklis, et al.
Published: (2025)

GOSt-MT: A Knowledge Graph for Occupation-related Gender Biases in Machine Translation
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024)

MusicLIME: Explainable Multimodal Music Understanding
by: Sotirou, Theodoros, et al.
Published: (2024)

Self-Explaining Hate Speech Detection with Moral Rationales
by: Vargas, Francielle, et al.
Published: (2026)

Aligning Attention with Human Rationales for Self-Explaining Hate Speech Detection
by: Eilertsen, Brage, et al.
Published: (2025)

Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation
by: Herrmann, Nils A., et al.
Published: (2026)

Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language
by: Saffari, Hamidreza, et al.
Published: (2024)

HateDebias: On the Diversity and Variability of Hate Speech Debiasing
by: Wu, Hongyan, et al.
Published: (2024)

Decoding Hate: Exploring Language Models' Reactions to Hate Speech
by: Piot, Paloma, et al.
Published: (2024)

"Is Hate Lost in Translation?": Evaluation of Multilingual LGBTQIA+ Hate Speech Detection
by: Chan, Fai Leui, et al.
Published: (2024)

Web(er) of Hate: A Survey on How Hate Speech Is Typed
by: Wang, Luna, et al.
Published: (2025)

Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models
by: Yuan, Shuzhou, et al.
Published: (2025)

LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
by: Yuan, Shuzhou, et al.
Published: (2025)

ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection
by: Mei, Jingbiao, et al.
Published: (2025)

Advancing Hate Speech Detection with Transformers: Insights from the MetaHate
by: Chapagain, Santosh, et al.
Published: (2025)

MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles
by: Ganguly, Amrita, et al.
Published: (2024)

When Hate Meets Facts: LLMs-in-the-Loop for Check-worthiness Detection in Hate Speech
by: Ocampo, Nicolás Benjamín, et al.
Published: (2026)

HateGPT: Unleashing GPT-3.5 Turbo to Combat Hate Speech on X
by: Deroy, Aniket, et al.
Published: (2024)

NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
by: Tonneau, Manuel, et al.
Published: (2024)

The Enforcement and Feasibility of Hate Speech Moderation on Twitter
by: Tonneau, Manuel, et al.
Published: (2026)

Challenger at MultiPRIDE: Is It Hate Speech or Reclaimed?
by: Tekanlou, Hadi Bayrami Asl, et al.
Published: (2026)

Compositional Generalisation for Explainable Hate Speech Detection
by: Calabrese, Agostina, et al.
Published: (2025)

Automatic Textual Normalization for Hate Speech Detection
by: Nguyen, Anh Thi-Hoang, et al.
Published: (2023)

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection
by: Piot, Paloma, et al.
Published: (2024)

GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
by: Jin, Yiping, et al.
Published: (2024)

HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection
by: Proskurina, Irina, et al.
Published: (2025)

HateModerate: Testing Hate Speech Detectors against Content Moderation Policies
by: Zheng, Jiangrui, et al.
Published: (2023)

AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)

Dialogues of Dissent: Thematic and Rhetorical Dimensions of Hate and Counter-Hate Speech in Social Media Conversations
by: Levi, Effi, et al.
Published: (2025)

HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation
by: Rizwan, Naquee, et al.
Published: (2025)

Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
by: Bui, Minh Duc, et al.
Published: (2024)

Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization
by: Zahid, Anwar Hossain, et al.
Published: (2025)

EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
by: Ilevbare, Comfort Eseohen, et al.
Published: (2024)