Saved in:
| Main Authors: | Fillies, Jan, Hoffmann, Michael Peter, Reichel, Rebecca, Salzwedel, Roman, Bodemer, Sven, Paschke, Adrian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.21084 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Malinowski in the Age of AI: Can large language models create a text game based on an anthropological classic?
by: Hoffmann, Michael Peter, et al.
Published: (2024)
by: Hoffmann, Michael Peter, et al.
Published: (2024)
Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration
by: Fillies, Jan, et al.
Published: (2025)
by: Fillies, Jan, et al.
Published: (2025)
ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation
by: Li, Peiran, et al.
Published: (2026)
by: Li, Peiran, et al.
Published: (2026)
A Hate Speech Moderated Chat Application: Use Case for GDPR and DSA Compliance
by: Fillies, Jan, et al.
Published: (2024)
by: Fillies, Jan, et al.
Published: (2024)
Algospeak, Hiding in the Open: The Trade-off Between Legible Meaning and Detection Avoidance
by: Fillies, Jan, et al.
Published: (2026)
by: Fillies, Jan, et al.
Published: (2026)
Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning
by: Hoffmann, Michael, et al.
Published: (2025)
by: Hoffmann, Michael, et al.
Published: (2025)
PolInterviews -- A Dataset of German Politician Public Broadcast Interviews
by: Birkenmaier, Lukas, et al.
Published: (2025)
by: Birkenmaier, Lukas, et al.
Published: (2025)
Benchmark on Peer Review Toxic Detection: A Challenging Task with a New Dataset
by: Luo, Man, et al.
Published: (2025)
by: Luo, Man, et al.
Published: (2025)
Race and Privacy in Broadcast Police Communications
by: Venkit, Pranav Narayanan, et al.
Published: (2024)
by: Venkit, Pranav Narayanan, et al.
Published: (2024)
Mapping Election Toxicity on Social Media across Issue, Ideology, and Psychosocial Dimensions
by: Cao, Lei, et al.
Published: (2026)
by: Cao, Lei, et al.
Published: (2026)
Towards Weakly-Supervised Hate Speech Classification Across Datasets
by: Jin, Yiping, et al.
Published: (2023)
by: Jin, Yiping, et al.
Published: (2023)
Robustness and Confounders in the Demographic Alignment of LLMs with Human Perceptions of Offensiveness
by: Alipour, Shayan, et al.
Published: (2024)
by: Alipour, Shayan, et al.
Published: (2024)
Fine-Grained Named Entities for Corona News
by: Efeoglu, Sefika, et al.
Published: (2024)
by: Efeoglu, Sefika, et al.
Published: (2024)
Toxic comments reduce the activity of volunteer editors on Wikipedia
by: Smirnov, Ivan, et al.
Published: (2023)
by: Smirnov, Ivan, et al.
Published: (2023)
Different Demographic Cues Yield Inconsistent Conclusions About LLM Personalization and Bias
by: Tonneau, Manuel, et al.
Published: (2026)
by: Tonneau, Manuel, et al.
Published: (2026)
From Demographics to Survey Anchors: Evaluating LLM Agents for Modeling Retirement Attitudes
by: Garzón, Rubén, et al.
Published: (2026)
by: Garzón, Rubén, et al.
Published: (2026)
Investigating Political and Demographic Associations in Large Language Models Through Moral Foundations Theory
by: Smith-Vaniz, Nicole, et al.
Published: (2025)
by: Smith-Vaniz, Nicole, et al.
Published: (2025)
ToXCL: A Unified Framework for Toxic Speech Detection and Explanation
by: Hoang, Nhat M., et al.
Published: (2024)
by: Hoang, Nhat M., et al.
Published: (2024)
Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions
by: Tasnim, Nazia, et al.
Published: (2024)
by: Tasnim, Nazia, et al.
Published: (2024)
Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks
by: Efeoglu, Sefika, et al.
Published: (2024)
by: Efeoglu, Sefika, et al.
Published: (2024)
Retrieval-Augmented Generation-based Relation Extraction
by: Efeoglu, Sefika, et al.
Published: (2024)
by: Efeoglu, Sefika, et al.
Published: (2024)
Beyond Demographics: Enhancing Cultural Value Survey Simulation with Multi-Stage Personality-Driven Cognitive Reasoning
by: Liu, Haijiang, et al.
Published: (2025)
by: Liu, Haijiang, et al.
Published: (2025)
RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
by: de Wynter, Adrian, et al.
Published: (2024)
by: de Wynter, Adrian, et al.
Published: (2024)
GermanPartiesQA: Benchmarking Commercial Large Language Models and AI Companions for Political Alignment and Sycophancy
by: Batzner, Jan, et al.
Published: (2024)
by: Batzner, Jan, et al.
Published: (2024)
The Monetisation of Toxicity: Analysing YouTube Content Creators and Controversy-Driven Engagement
by: Bertaglia, Thales, et al.
Published: (2024)
by: Bertaglia, Thales, et al.
Published: (2024)
BTPD: A Multilingual Hand-curated Dataset of Bengali Transnational Political Discourse Across Online Communities
by: Das, Dipto, et al.
Published: (2025)
by: Das, Dipto, et al.
Published: (2025)
Sometimes the Model doth Preach: Quantifying Religious Bias in Open LLMs through Demographic Analysis in Asian Nations
by: Shankar, Hari, et al.
Published: (2025)
by: Shankar, Hari, et al.
Published: (2025)
Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences
by: Hoffmann, Morgane, et al.
Published: (2026)
by: Hoffmann, Morgane, et al.
Published: (2026)
Down the Toxicity Rabbit Hole: A Novel Framework to Bias Audit Large Language Models
by: Dutta, Arka, et al.
Published: (2023)
by: Dutta, Arka, et al.
Published: (2023)
Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
by: Iskander, Shadi, et al.
Published: (2024)
by: Iskander, Shadi, et al.
Published: (2024)
Extracting O*NET Features from the NLx Corpus to Build Public Use Aggregate Labor Market Data
by: Meisenbacher, Stephen, et al.
Published: (2025)
by: Meisenbacher, Stephen, et al.
Published: (2025)
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
by: Pachinger, Pia, et al.
Published: (2024)
by: Pachinger, Pia, et al.
Published: (2024)
IndRegBias: A Dataset for Studying Indian Regional Biases in English and Code-Mixed Social Media Comments
by: Panda, Debasmita, et al.
Published: (2026)
by: Panda, Debasmita, et al.
Published: (2026)
Post-Training Language Models for Continual Relation Extraction
by: Efeoglu, Sefika, et al.
Published: (2025)
by: Efeoglu, Sefika, et al.
Published: (2025)
RoMathExam: A Longitudinal Dataset of Romanian Math Exams (1895-2025) with a Seven-Decade Core (1957-2025)
by: Cuclea, Luca-Ncolae, et al.
Published: (2026)
by: Cuclea, Luca-Ncolae, et al.
Published: (2026)
Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study
by: Majumdar, Ayan, et al.
Published: (2025)
by: Majumdar, Ayan, et al.
Published: (2025)
The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations
by: Salinas, Abel, et al.
Published: (2023)
by: Salinas, Abel, et al.
Published: (2023)
Exploiting User Comments for Early Detection of Fake News Prior to Users' Commenting
by: Nan, Qiong, et al.
Published: (2023)
by: Nan, Qiong, et al.
Published: (2023)
Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds
by: Eichel, Annerose, et al.
Published: (2024)
by: Eichel, Annerose, et al.
Published: (2024)
On the Reliability of Large Language Models to Misinformed and Demographically-Informed Prompts
by: Aremu, Toluwani, et al.
Published: (2024)
by: Aremu, Toluwani, et al.
Published: (2024)
Similar Items
-
Malinowski in the Age of AI: Can large language models create a text game based on an anthropological classic?
by: Hoffmann, Michael Peter, et al.
Published: (2024) -
Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration
by: Fillies, Jan, et al.
Published: (2025) -
ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation
by: Li, Peiran, et al.
Published: (2026) -
A Hate Speech Moderated Chat Application: Use Case for GDPR and DSA Compliance
by: Fillies, Jan, et al.
Published: (2024) -
Algospeak, Hiding in the Open: The Trade-off Between Legible Meaning and Detection Avoidance
by: Fillies, Jan, et al.
Published: (2026)