Saved in:
| Main Authors: | Pang, Bo, Qiao, Tingrui, Walker, Caroline, Cunningham, Chris, Koh, Yun Sing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.01679 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fair Representation in Parliamentary Summaries: Measuring and Mitigating Inclusion Bias
by: Cunningham, Eoghan, et al.
Published: (2025)
by: Cunningham, Eoghan, et al.
Published: (2025)
MetaWild: A Multimodal Dataset for Animal Re-Identification with Environmental Metadata
by: Li, Yuzhuo, et al.
Published: (2025)
by: Li, Yuzhuo, et al.
Published: (2025)
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
by: Kumar, Abhishek, et al.
Published: (2024)
by: Kumar, Abhishek, et al.
Published: (2024)
Bias and Fairness in Large Language Models: A Survey
by: Gallegos, Isabel O., et al.
Published: (2023)
by: Gallegos, Isabel O., et al.
Published: (2023)
Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models
by: Arzaghi', 'Mina, et al.
Published: (2025)
by: Arzaghi', 'Mina, et al.
Published: (2025)
What's in a Name? Auditing Large Language Models for Race and Gender Bias
by: Salinas, Alejandro, et al.
Published: (2024)
by: Salinas, Alejandro, et al.
Published: (2024)
Contextual StereoSet: Stress-Testing Bias Alignment Robustness in Large Language Models
by: Basu, Abhinaba, et al.
Published: (2026)
by: Basu, Abhinaba, et al.
Published: (2026)
ELMES: An Automated Framework for Evaluating Large Language Models in Educational Scenarios
by: Wei, Shou'ang, et al.
Published: (2025)
by: Wei, Shou'ang, et al.
Published: (2025)
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases
by: Bouchard, Dylan, et al.
Published: (2025)
by: Bouchard, Dylan, et al.
Published: (2025)
Towards Modeling Learner Performance with Large Language Models
by: Neshaei, Seyed Parsa, et al.
Published: (2024)
by: Neshaei, Seyed Parsa, et al.
Published: (2024)
BiasEdit: Debiasing Stereotyped Language Models via Model Editing
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
Exploring Accuracy-Fairness Trade-off in Large Language Models
by: Zhang, Qingquan, et al.
Published: (2024)
by: Zhang, Qingquan, et al.
Published: (2024)
Generalization in Healthcare AI: Evaluation of a Clinical Large Language Model
by: Rahman, Salman, et al.
Published: (2024)
by: Rahman, Salman, et al.
Published: (2024)
A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
by: Pfohl, Stephen R., et al.
Published: (2024)
by: Pfohl, Stephen R., et al.
Published: (2024)
Protected group bias and stereotypes in Large Language Models
by: Kotek, Hadas, et al.
Published: (2024)
by: Kotek, Hadas, et al.
Published: (2024)
Understanding Intrinsic Socioeconomic Biases in Large Language Models
by: Arzaghi, Mina, et al.
Published: (2024)
by: Arzaghi, Mina, et al.
Published: (2024)
Limits to Predicting Online Speech Using Large Language Models
by: Remeli, Mina, et al.
Published: (2024)
by: Remeli, Mina, et al.
Published: (2024)
Harnessing Large Language Models for Disaster Management: A Survey
by: Lei, Zhenyu, et al.
Published: (2025)
by: Lei, Zhenyu, et al.
Published: (2025)
Revealing Fine-Grained Values and Opinions in Large Language Models
by: Wright, Dustin, et al.
Published: (2024)
by: Wright, Dustin, et al.
Published: (2024)
Mitigating Bias for Question Answering Models by Tracking Bias Influence
by: Ma, Mingyu Derek, et al.
Published: (2023)
by: Ma, Mingyu Derek, et al.
Published: (2023)
Addressing Stereotypes in Large Language Models: A Critical Examination and Mitigation
by: Kazi, Fatima
Published: (2025)
by: Kazi, Fatima
Published: (2025)
Exploring the Potential of the Large Language Models (LLMs) in Identifying Misleading News Headlines
by: Rony, Md Main Uddin, et al.
Published: (2024)
by: Rony, Md Main Uddin, et al.
Published: (2024)
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
by: Tedeschi, Simone, et al.
Published: (2024)
by: Tedeschi, Simone, et al.
Published: (2024)
Communication Bias in Large Language Models: A Regulatory Perspective
by: Kuenzler, Adrian, et al.
Published: (2025)
by: Kuenzler, Adrian, et al.
Published: (2025)
Do Large Language Models Walk Their Talk? Measuring the Gap Between Implicit Associations, Self-Report, and Behavioral Altruism
by: Andric, Sandro
Published: (2025)
by: Andric, Sandro
Published: (2025)
Pro-AI Bias in Large Language Models
by: Trabelsi, Benaya, et al.
Published: (2026)
by: Trabelsi, Benaya, et al.
Published: (2026)
Cultural Alignment in Large Language Models: An Explanatory Analysis Based on Hofstede's Cultural Dimensions
by: Masoud, Reem I., et al.
Published: (2023)
by: Masoud, Reem I., et al.
Published: (2023)
DSO: Direct Steering Optimization for Bias Mitigation
by: Paes, Lucas Monteiro, et al.
Published: (2025)
by: Paes, Lucas Monteiro, et al.
Published: (2025)
What Large Language Models Do Not Talk About: An Empirical Study of Moderation and Censorship Practices
by: Noels, Sander, et al.
Published: (2025)
by: Noels, Sander, et al.
Published: (2025)
A Detailed Factor Analysis for the Political Compass Test: Navigating Ideologies of Large Language Models
by: Kamal, Sadia, et al.
Published: (2025)
by: Kamal, Sadia, et al.
Published: (2025)
Estimating Item Difficulty Using Large Language Models and Tree-Based Machine Learning Algorithms
by: Razavi, Pooya, et al.
Published: (2025)
by: Razavi, Pooya, et al.
Published: (2025)
The Moral Gap of Large Language Models
by: Skorski, Maciej, et al.
Published: (2025)
by: Skorski, Maciej, et al.
Published: (2025)
Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation
by: Vargas, Francisco, et al.
Published: (2020)
by: Vargas, Francisco, et al.
Published: (2020)
Correlated Errors in Large Language Models
by: Kim, Elliot, et al.
Published: (2025)
by: Kim, Elliot, et al.
Published: (2025)
Hypothesis Generation with Large Language Models
by: Zhou, Yangqiaoyu, et al.
Published: (2024)
by: Zhou, Yangqiaoyu, et al.
Published: (2024)
Large Language Models are Geographically Biased
by: Manvi, Rohin, et al.
Published: (2024)
by: Manvi, Rohin, et al.
Published: (2024)
Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment
by: Borchers, Conrad, et al.
Published: (2025)
by: Borchers, Conrad, et al.
Published: (2025)
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
GG-BBQ: German Gender Bias Benchmark for Question Answering
by: Satheesh, Shalaka, et al.
Published: (2025)
by: Satheesh, Shalaka, et al.
Published: (2025)
Similar Items
-
Fair Representation in Parliamentary Summaries: Measuring and Mitigating Inclusion Bias
by: Cunningham, Eoghan, et al.
Published: (2025) -
MetaWild: A Multimodal Dataset for Animal Re-Identification with Environmental Metadata
by: Li, Yuzhuo, et al.
Published: (2025) -
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses
by: Xu, Xin, et al.
Published: (2025) -
Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
by: Kumar, Abhishek, et al.
Published: (2024) -
Bias and Fairness in Large Language Models: A Survey
by: Gallegos, Isabel O., et al.
Published: (2023)