Saved in:
| Main Authors: | Liartis, Jason, Kaldeli, Eirini, Gyftokosta, Lambrini, Chelioudakis, Eleftherios, Mastromichalakis, Orfeas Menis |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.14970 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Don't Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)
Beyond One-Size-Fits-All: Adapting Counterfactual Explanations to User Objectives
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024)
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024)
The Grounding Gap: How LLMs Anchor the Meaning of Abstract Concepts Differently from Humans
by: Chlapanis, Odysseas S., et al.
Published: (2026)
by: Chlapanis, Odysseas S., et al.
Published: (2026)
Assumed Identities: Quantifying Gender Bias in Machine Translation of Gender-Ambiguous Occupational Terms
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)
Semantic Prototypes: Enhancing Transparency Without Black Boxes
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)
GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics
by: Filandrianos, Giorgos, et al.
Published: (2025)
by: Filandrianos, Giorgos, et al.
Published: (2025)
Deep Ensemble Art Style Recognition
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)
AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking
by: Premptis, Iraklis, et al.
Published: (2025)
by: Premptis, Iraklis, et al.
Published: (2025)
GOSt-MT: A Knowledge Graph for Occupation-related Gender Biases in Machine Translation
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024)
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024)
MusicLIME: Explainable Multimodal Music Understanding
by: Sotirou, Theodoros, et al.
Published: (2024)
by: Sotirou, Theodoros, et al.
Published: (2024)
Self-Explaining Hate Speech Detection with Moral Rationales
by: Vargas, Francielle, et al.
Published: (2026)
by: Vargas, Francielle, et al.
Published: (2026)
Aligning Attention with Human Rationales for Self-Explaining Hate Speech Detection
by: Eilertsen, Brage, et al.
Published: (2025)
by: Eilertsen, Brage, et al.
Published: (2025)
Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation
by: Herrmann, Nils A., et al.
Published: (2026)
by: Herrmann, Nils A., et al.
Published: (2026)
Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language
by: Saffari, Hamidreza, et al.
Published: (2024)
by: Saffari, Hamidreza, et al.
Published: (2024)
HateDebias: On the Diversity and Variability of Hate Speech Debiasing
by: Wu, Hongyan, et al.
Published: (2024)
by: Wu, Hongyan, et al.
Published: (2024)
Decoding Hate: Exploring Language Models' Reactions to Hate Speech
by: Piot, Paloma, et al.
Published: (2024)
by: Piot, Paloma, et al.
Published: (2024)
"Is Hate Lost in Translation?": Evaluation of Multilingual LGBTQIA+ Hate Speech Detection
by: Chan, Fai Leui, et al.
Published: (2024)
by: Chan, Fai Leui, et al.
Published: (2024)
Web(er) of Hate: A Survey on How Hate Speech Is Typed
by: Wang, Luna, et al.
Published: (2025)
by: Wang, Luna, et al.
Published: (2025)
Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models
by: Yuan, Shuzhou, et al.
Published: (2025)
by: Yuan, Shuzhou, et al.
Published: (2025)
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
by: Yuan, Shuzhou, et al.
Published: (2025)
by: Yuan, Shuzhou, et al.
Published: (2025)
ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection
by: Mei, Jingbiao, et al.
Published: (2025)
by: Mei, Jingbiao, et al.
Published: (2025)
Advancing Hate Speech Detection with Transformers: Insights from the MetaHate
by: Chapagain, Santosh, et al.
Published: (2025)
by: Chapagain, Santosh, et al.
Published: (2025)
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles
by: Ganguly, Amrita, et al.
Published: (2024)
by: Ganguly, Amrita, et al.
Published: (2024)
When Hate Meets Facts: LLMs-in-the-Loop for Check-worthiness Detection in Hate Speech
by: Ocampo, Nicolás Benjamín, et al.
Published: (2026)
by: Ocampo, Nicolás Benjamín, et al.
Published: (2026)
HateGPT: Unleashing GPT-3.5 Turbo to Combat Hate Speech on X
by: Deroy, Aniket, et al.
Published: (2024)
by: Deroy, Aniket, et al.
Published: (2024)
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
by: Tonneau, Manuel, et al.
Published: (2024)
by: Tonneau, Manuel, et al.
Published: (2024)
The Enforcement and Feasibility of Hate Speech Moderation on Twitter
by: Tonneau, Manuel, et al.
Published: (2026)
by: Tonneau, Manuel, et al.
Published: (2026)
Challenger at MultiPRIDE: Is It Hate Speech or Reclaimed?
by: Tekanlou, Hadi Bayrami Asl, et al.
Published: (2026)
by: Tekanlou, Hadi Bayrami Asl, et al.
Published: (2026)
Compositional Generalisation for Explainable Hate Speech Detection
by: Calabrese, Agostina, et al.
Published: (2025)
by: Calabrese, Agostina, et al.
Published: (2025)
Automatic Textual Normalization for Hate Speech Detection
by: Nguyen, Anh Thi-Hoang, et al.
Published: (2023)
by: Nguyen, Anh Thi-Hoang, et al.
Published: (2023)
MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection
by: Piot, Paloma, et al.
Published: (2024)
by: Piot, Paloma, et al.
Published: (2024)
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
by: Jin, Yiping, et al.
Published: (2024)
by: Jin, Yiping, et al.
Published: (2024)
HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection
by: Proskurina, Irina, et al.
Published: (2025)
by: Proskurina, Irina, et al.
Published: (2025)
HateModerate: Testing Hate Speech Detectors against Content Moderation Policies
by: Zheng, Jiangrui, et al.
Published: (2023)
by: Zheng, Jiangrui, et al.
Published: (2023)
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)
Dialogues of Dissent: Thematic and Rhetorical Dimensions of Hate and Counter-Hate Speech in Social Media Conversations
by: Levi, Effi, et al.
Published: (2025)
by: Levi, Effi, et al.
Published: (2025)
HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation
by: Rizwan, Naquee, et al.
Published: (2025)
by: Rizwan, Naquee, et al.
Published: (2025)
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
by: Bui, Minh Duc, et al.
Published: (2024)
by: Bui, Minh Duc, et al.
Published: (2024)
Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization
by: Zahid, Anwar Hossain, et al.
Published: (2025)
by: Zahid, Anwar Hossain, et al.
Published: (2025)
EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
by: Ilevbare, Comfort Eseohen, et al.
Published: (2024)
by: Ilevbare, Comfort Eseohen, et al.
Published: (2024)
Similar Items
-
Don't Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025) -
Beyond One-Size-Fits-All: Adapting Counterfactual Explanations to User Objectives
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2024) -
The Grounding Gap: How LLMs Anchor the Meaning of Abstract Concepts Differently from Humans
by: Chlapanis, Odysseas S., et al.
Published: (2026) -
Assumed Identities: Quantifying Gender Bias in Machine Translation of Gender-Ambiguous Occupational Terms
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025) -
Semantic Prototypes: Enhancing Transparency Without Black Boxes
by: Menis-Mastromichalakis, Orfeas, et al.
Published: (2024)