Saved in:
| Main Authors: | Siddique, Zara, Turner, Liam D., Espinosa-Anke, Luis |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.06917 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
by: Siddique, Zara, et al.
Published: (2025)
by: Siddique, Zara, et al.
Published: (2025)
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts
by: Nakanishi, Akito, et al.
Published: (2025)
by: Nakanishi, Akito, et al.
Published: (2025)
Addressing Stereotypes in Large Language Models: A Critical Examination and Mitigation
by: Kazi, Fatima
Published: (2025)
by: Kazi, Fatima
Published: (2025)
A Taxonomy of Stereotype Content in Large Language Models
by: Nicolas, Gandalf, et al.
Published: (2024)
by: Nicolas, Gandalf, et al.
Published: (2024)
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
by: Plaza-del-Arco, Flor Miriam, et al.
Published: (2024)
by: Plaza-del-Arco, Flor Miriam, et al.
Published: (2024)
DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis
by: Vijayaraghavan, Prashanth, et al.
Published: (2025)
by: Vijayaraghavan, Prashanth, et al.
Published: (2025)
Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency
by: Liu, Yiran, et al.
Published: (2024)
by: Liu, Yiran, et al.
Published: (2024)
A Survey on Stereotype Detection in Natural Language Processing
by: Cignarella, Alessandra Teresa, et al.
Published: (2025)
by: Cignarella, Alessandra Teresa, et al.
Published: (2025)
An Empirical Investigation of Gender Stereotype Representation in Large Language Models: The Italian Case
by: Giachino, Gioele, et al.
Published: (2025)
by: Giachino, Gioele, et al.
Published: (2025)
Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes
by: Gallegos, Isabel O., et al.
Published: (2024)
by: Gallegos, Isabel O., et al.
Published: (2024)
Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
by: Nejadgholi, Isar, et al.
Published: (2024)
by: Nejadgholi, Isar, et al.
Published: (2024)
Dialz: A Python Toolkit for Steering Vectors
by: Siddique, Zara, et al.
Published: (2025)
by: Siddique, Zara, et al.
Published: (2025)
Large Language Models as Students Who Think Aloud: Overly Coherent, Verbose, and Confident
by: Borchers, Conrad, et al.
Published: (2026)
by: Borchers, Conrad, et al.
Published: (2026)
Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation
by: Boisson, Joanne, et al.
Published: (2024)
by: Boisson, Joanne, et al.
Published: (2024)
Local Contrastive Editing of Gender Stereotypes
by: Lutz, Marlene, et al.
Published: (2024)
by: Lutz, Marlene, et al.
Published: (2024)
BiasEdit: Debiasing Stereotyped Language Models via Model Editing
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
From General Reasoning to Domain Expertise: Uncovering the Limits of Generalization in Large Language Models
by: Alsagheer, Dana, et al.
Published: (2025)
by: Alsagheer, Dana, et al.
Published: (2025)
SESGO: Spanish Evaluation of Stereotypical Generative Outputs
by: Robles, Melissa, et al.
Published: (2025)
by: Robles, Melissa, et al.
Published: (2025)
On The Role of Reasoning in the Identification of Subtle Stereotypes in Natural Language
by: Tian, Jacob-Junqi, et al.
Published: (2023)
by: Tian, Jacob-Junqi, et al.
Published: (2023)
Who Shares Fake News? Uncovering Insights from Social Media Users' Post Histories
by: Schoenmueller, Verena, et al.
Published: (2022)
by: Schoenmueller, Verena, et al.
Published: (2022)
Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs
by: Saeed, Muhammed, et al.
Published: (2025)
by: Saeed, Muhammed, et al.
Published: (2025)
Probing the Subtle Ideological Manipulation of Large Language Models
by: Paschalides, Demetris, et al.
Published: (2025)
by: Paschalides, Demetris, et al.
Published: (2025)
Motivation in Large Language Models
by: Nahum, Omer, et al.
Published: (2026)
by: Nahum, Omer, et al.
Published: (2026)
Reinforcing Stereotypes of Anger: Emotion AI on African American Vernacular English
by: Dorn, Rebecca, et al.
Published: (2025)
by: Dorn, Rebecca, et al.
Published: (2025)
StereoTales: A Multilingual Framework for Open-Ended Stereotype Discovery in LLMs
by: Jeune, Pierre Le, et al.
Published: (2026)
by: Jeune, Pierre Le, et al.
Published: (2026)
Language of Thought Shapes Output Diversity in Large Language Models
by: Xu, Shaoyang, et al.
Published: (2026)
by: Xu, Shaoyang, et al.
Published: (2026)
Leveraging Large Language Models for Actionable Course Evaluation Student Feedback to Lecturers
by: Zhang, Mike, et al.
Published: (2024)
by: Zhang, Mike, et al.
Published: (2024)
Investigating Cultural Alignment of Large Language Models
by: AlKhamissi, Badr, et al.
Published: (2024)
by: AlKhamissi, Badr, et al.
Published: (2024)
Multilingual Large Language Models and Curse of Multilinguality
by: Gurgurov, Daniil, et al.
Published: (2024)
by: Gurgurov, Daniil, et al.
Published: (2024)
On Classification with Large Language Models in Cultural Analytics
by: Bamman, David, et al.
Published: (2024)
by: Bamman, David, et al.
Published: (2024)
Will Large Language Models Transform Clinical Prediction?
by: Yildiz, Yusuf, et al.
Published: (2025)
by: Yildiz, Yusuf, et al.
Published: (2025)
Large Language Models in the Abuse Detection Pipeline
by: Kath, Suraj, et al.
Published: (2026)
by: Kath, Suraj, et al.
Published: (2026)
Climate Change from Large Language Models
by: Zhu, Hongyin, et al.
Published: (2023)
by: Zhu, Hongyin, et al.
Published: (2023)
Urban Computing in the Era of Large Language Models
by: Li, Zhonghang, et al.
Published: (2025)
by: Li, Zhonghang, et al.
Published: (2025)
The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data
by: Luca, Massimiliano, et al.
Published: (2025)
by: Luca, Massimiliano, et al.
Published: (2025)
Development of Application-Specific Large Language Models to Facilitate Research Ethics Review
by: Mann, Sebastian Porsdam, et al.
Published: (2025)
by: Mann, Sebastian Porsdam, et al.
Published: (2025)
Uncovering Regulatory Affairs Complexity in Medical Products: A Qualitative Assessment Utilizing Open Coding and Natural Language Processing (NLP)
by: Han, Yu, et al.
Published: (2023)
by: Han, Yu, et al.
Published: (2023)
MALIBU Benchmark: Multi-Agent LLM Implicit Bias Uncovered
by: Mirza, Imran, et al.
Published: (2025)
by: Mirza, Imran, et al.
Published: (2025)
StereoDetect: Detecting Stereotypes and Anti-stereotypes the Correct Way Using Social Psychological Underpinnings
by: Shejole, Kaustubh Shivshankar, et al.
Published: (2025)
by: Shejole, Kaustubh Shivshankar, et al.
Published: (2025)
The World of Generative AI: Deepfakes and Large Language Models
by: Mitra, Alakananda, et al.
Published: (2024)
by: Mitra, Alakananda, et al.
Published: (2024)
Similar Items
-
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
by: Siddique, Zara, et al.
Published: (2025) -
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts
by: Nakanishi, Akito, et al.
Published: (2025) -
Addressing Stereotypes in Large Language Models: A Critical Examination and Mitigation
by: Kazi, Fatima
Published: (2025) -
A Taxonomy of Stereotype Content in Large Language Models
by: Nicolas, Gandalf, et al.
Published: (2024) -
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
by: Plaza-del-Arco, Flor Miriam, et al.
Published: (2024)