:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fillies, Jan, Hoffmann, Michael Peter, Reichel, Rebecca, Salzwedel, Roman, Bodemer, Sven, Paschke, Adrian
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Computers and Society
Online Access:	https://arxiv.org/abs/2508.21084
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Malinowski in the Age of AI: Can large language models create a text game based on an anthropological classic?
by: Hoffmann, Michael Peter, et al.
Published: (2024)

Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration
by: Fillies, Jan, et al.
Published: (2025)

ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation
by: Li, Peiran, et al.
Published: (2026)

A Hate Speech Moderated Chat Application: Use Case for GDPR and DSA Compliance
by: Fillies, Jan, et al.
Published: (2024)

Algospeak, Hiding in the Open: The Trade-off Between Legible Meaning and Detection Avoidance
by: Fillies, Jan, et al.
Published: (2026)

Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning
by: Hoffmann, Michael, et al.
Published: (2025)

PolInterviews -- A Dataset of German Politician Public Broadcast Interviews
by: Birkenmaier, Lukas, et al.
Published: (2025)

Benchmark on Peer Review Toxic Detection: A Challenging Task with a New Dataset
by: Luo, Man, et al.
Published: (2025)

Race and Privacy in Broadcast Police Communications
by: Venkit, Pranav Narayanan, et al.
Published: (2024)

Mapping Election Toxicity on Social Media across Issue, Ideology, and Psychosocial Dimensions
by: Cao, Lei, et al.
Published: (2026)

Towards Weakly-Supervised Hate Speech Classification Across Datasets
by: Jin, Yiping, et al.
Published: (2023)

Robustness and Confounders in the Demographic Alignment of LLMs with Human Perceptions of Offensiveness
by: Alipour, Shayan, et al.
Published: (2024)

Fine-Grained Named Entities for Corona News
by: Efeoglu, Sefika, et al.
Published: (2024)

Toxic comments reduce the activity of volunteer editors on Wikipedia
by: Smirnov, Ivan, et al.
Published: (2023)

Different Demographic Cues Yield Inconsistent Conclusions About LLM Personalization and Bias
by: Tonneau, Manuel, et al.
Published: (2026)

From Demographics to Survey Anchors: Evaluating LLM Agents for Modeling Retirement Attitudes
by: Garzón, Rubén, et al.
Published: (2026)

Investigating Political and Demographic Associations in Large Language Models Through Moral Foundations Theory
by: Smith-Vaniz, Nicole, et al.
Published: (2025)

ToXCL: A Unified Framework for Toxic Speech Detection and Explanation
by: Hoang, Nhat M., et al.
Published: (2024)

Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions
by: Tasnim, Nazia, et al.
Published: (2024)

Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks
by: Efeoglu, Sefika, et al.
Published: (2024)

Retrieval-Augmented Generation-based Relation Extraction
by: Efeoglu, Sefika, et al.
Published: (2024)

Beyond Demographics: Enhancing Cultural Value Survey Simulation with Multi-Stage Personality-Driven Cognitive Reasoning
by: Liu, Haijiang, et al.
Published: (2025)

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
by: de Wynter, Adrian, et al.
Published: (2024)

GermanPartiesQA: Benchmarking Commercial Large Language Models and AI Companions for Political Alignment and Sycophancy
by: Batzner, Jan, et al.
Published: (2024)

The Monetisation of Toxicity: Analysing YouTube Content Creators and Controversy-Driven Engagement
by: Bertaglia, Thales, et al.
Published: (2024)

BTPD: A Multilingual Hand-curated Dataset of Bengali Transnational Political Discourse Across Online Communities
by: Das, Dipto, et al.
Published: (2025)

Sometimes the Model doth Preach: Quantifying Religious Bias in Open LLMs through Demographic Analysis in Asian Nations
by: Shankar, Hari, et al.
Published: (2025)

Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences
by: Hoffmann, Morgane, et al.
Published: (2026)

Down the Toxicity Rabbit Hole: A Novel Framework to Bias Audit Large Language Models
by: Dutta, Arka, et al.
Published: (2023)

Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
by: Iskander, Shadi, et al.
Published: (2024)

Extracting O*NET Features from the NLx Corpus to Build Public Use Aggregate Labor Market Data
by: Meisenbacher, Stephen, et al.
Published: (2025)

AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
by: Pachinger, Pia, et al.
Published: (2024)

IndRegBias: A Dataset for Studying Indian Regional Biases in English and Code-Mixed Social Media Comments
by: Panda, Debasmita, et al.
Published: (2026)

Post-Training Language Models for Continual Relation Extraction
by: Efeoglu, Sefika, et al.
Published: (2025)

RoMathExam: A Longitudinal Dataset of Romanian Math Exams (1895-2025) with a Seven-Decade Core (1957-2025)
by: Cuclea, Luca-Ncolae, et al.
Published: (2026)

Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study
by: Majumdar, Ayan, et al.
Published: (2025)

The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations
by: Salinas, Abel, et al.
Published: (2023)

Exploiting User Comments for Early Detection of Fake News Prior to Users' Commenting
by: Nan, Qiong, et al.
Published: (2023)

Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds
by: Eichel, Annerose, et al.
Published: (2024)

On the Reliability of Large Language Models to Misinformed and Demographically-Informed Prompts
by: Aremu, Toluwani, et al.
Published: (2024)