Saved in:
| Main Authors: | Zhou, Ej, Lu, Weiming |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.11183 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Gaps between Pre-train and Downstream Settings in Bias Evaluation and Debiasing
by: Kaneko, Masahiro, et al.
Published: (2024)
by: Kaneko, Masahiro, et al.
Published: (2024)
OffsetBias: Leveraging Debiased Data for Tuning Evaluators
by: Park, Junsoo, et al.
Published: (2024)
by: Park, Junsoo, et al.
Published: (2024)
Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization
by: Zhou, Hongli, et al.
Published: (2026)
by: Zhou, Hongli, et al.
Published: (2026)
Beyond English: Unveiling Multilingual Bias in LLM Copyright Compliance
by: Chen, Yupeng, et al.
Published: (2025)
by: Chen, Yupeng, et al.
Published: (2025)
On Bias and Fairness in NLP: Investigating the Impact of Bias and Debiasing in Language Models on the Fairness of Toxicity Detection
by: Elsafoury, Fatma, et al.
Published: (2023)
by: Elsafoury, Fatma, et al.
Published: (2023)
Evaluating and Mitigating Social Bias for Large Language Models in Open-ended Settings
by: Liu, Zhao, et al.
Published: (2024)
by: Liu, Zhao, et al.
Published: (2024)
Debiasing CLIP: Interpreting and Correcting Bias in Attention Heads
by: Yeo, Wei Jie, et al.
Published: (2025)
by: Yeo, Wei Jie, et al.
Published: (2025)
Are Bias Evaluation Methods Biased ?
by: Berrayana, Lina, et al.
Published: (2025)
by: Berrayana, Lina, et al.
Published: (2025)
Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
by: Yang, Dingkang, et al.
Published: (2024)
by: Yang, Dingkang, et al.
Published: (2024)
BiasFilter: An Inference-Time Debiasing Framework for Large Language Models
by: Cheng, Xiaoqing, et al.
Published: (2025)
by: Cheng, Xiaoqing, et al.
Published: (2025)
Unboxing Occupational Bias: Grounded Debiasing of LLMs with U.S. Labor Data
by: Gorti, Atmika, et al.
Published: (2024)
by: Gorti, Atmika, et al.
Published: (2024)
FairCoder: Evaluating Social Bias of LLMs in Code Generation
by: Du, Yongkang, et al.
Published: (2025)
by: Du, Yongkang, et al.
Published: (2025)
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
by: Liu, Guangliang, et al.
Published: (2024)
by: Liu, Guangliang, et al.
Published: (2024)
Open-DeBias: Toward Mitigating Open-Set Bias in Language Models
by: Rani, Arti, et al.
Published: (2025)
by: Rani, Arti, et al.
Published: (2025)
Gender Bias in English-to-Greek Machine Translation
by: Gkovedarou, Eleni, et al.
Published: (2025)
by: Gkovedarou, Eleni, et al.
Published: (2025)
BiasEdit: Debiasing Stereotyped Language Models via Model Editing
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
Beyond the Final Layer: Intermediate Representations for Better Multilingual Calibration in Large Language Models
by: Zhou, Ej, et al.
Published: (2025)
by: Zhou, Ej, et al.
Published: (2025)
Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies
by: Abrar, Ajwad, et al.
Published: (2025)
by: Abrar, Ajwad, et al.
Published: (2025)
Mind the Language Gap: Automated and Augmented Evaluation of Bias in LLMs for High- and Low-Resource Languages
by: Buscemi, Alessio, et al.
Published: (2025)
by: Buscemi, Alessio, et al.
Published: (2025)
Does Reasoning Introduce Bias? A Study of Social Bias Evaluation and Mitigation in LLM Reasoning
by: Wu, Xuyang, et al.
Published: (2025)
by: Wu, Xuyang, et al.
Published: (2025)
Evaluating Social Bias in RAG Systems: When External Context Helps and Reasoning Hurts
by: Parihar, Shweta, et al.
Published: (2026)
by: Parihar, Shweta, et al.
Published: (2026)
Trustworthy Social Bias Measurement
by: Bommasani, Rishi, et al.
Published: (2022)
by: Bommasani, Rishi, et al.
Published: (2022)
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models
by: Raj, Chahat, et al.
Published: (2025)
by: Raj, Chahat, et al.
Published: (2025)
FIBER: A Multilingual Evaluation Resource for Factual Inference Bias
by: Munis, Evren Ayberk, et al.
Published: (2025)
by: Munis, Evren Ayberk, et al.
Published: (2025)
From Measurement to Mitigation: Exploring the Transferability of Debiasing Approaches to Gender Bias in Maltese Language Models
by: Galea, Melanie, et al.
Published: (2025)
by: Galea, Melanie, et al.
Published: (2025)
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
by: Fan, Zhiting, et al.
Published: (2024)
by: Fan, Zhiting, et al.
Published: (2024)
Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector
by: Yang, Haoyan, et al.
Published: (2025)
by: Yang, Haoyan, et al.
Published: (2025)
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
by: Lum, Kristian, et al.
Published: (2024)
by: Lum, Kristian, et al.
Published: (2024)
Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts
by: Lin, Yujie, et al.
Published: (2026)
by: Lin, Yujie, et al.
Published: (2026)
Towards Resource Efficient and Interpretable Bias Mitigation in Large Language Models
by: Tong, Schrasing, et al.
Published: (2024)
by: Tong, Schrasing, et al.
Published: (2024)
Social Bias Evaluation for Large Language Models Requires Prompt Variations
by: Hida, Rem, et al.
Published: (2024)
by: Hida, Rem, et al.
Published: (2024)
Exploring Gender Bias Beyond Occupational Titles
by: Sabir, Ahmed, et al.
Published: (2025)
by: Sabir, Ahmed, et al.
Published: (2025)
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias
by: Sadhu, Jayanta, et al.
Published: (2024)
by: Sadhu, Jayanta, et al.
Published: (2024)
Mitigating the Bias of Large Language Model Evaluation
by: Zhou, Hongli, et al.
Published: (2024)
by: Zhou, Hongli, et al.
Published: (2024)
Bias Beyond Borders: Political Ideology Evaluation and Steering in Multilingual LLMs
by: Nadeem, Afrozah, et al.
Published: (2026)
by: Nadeem, Afrozah, et al.
Published: (2026)
Veracity Bias and Beyond: Uncovering LLMs' Hidden Beliefs in Problem-Solving Reasoning
by: Zhou, Yue, et al.
Published: (2025)
by: Zhou, Yue, et al.
Published: (2025)
Bias Dynamics in BabyLMs: Towards a Compute-Efficient Sandbox for Democratising Pre-Training Debiasing
by: Trhlik, Filip, et al.
Published: (2026)
by: Trhlik, Filip, et al.
Published: (2026)
Mitigating Social Bias in English and Urdu Language Models Using PRM-Guided Candidate Selection and Sequential Refinement
by: Khan, Muneeb Ur Raheem
Published: (2025)
by: Khan, Muneeb Ur Raheem
Published: (2025)
Evaluating Scoring Bias in LLM-as-a-Judge
by: Li, Qingquan, et al.
Published: (2025)
by: Li, Qingquan, et al.
Published: (2025)
BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models
by: Xie, Tian, et al.
Published: (2025)
by: Xie, Tian, et al.
Published: (2025)
Similar Items
-
The Gaps between Pre-train and Downstream Settings in Bias Evaluation and Debiasing
by: Kaneko, Masahiro, et al.
Published: (2024) -
OffsetBias: Leveraging Debiased Data for Tuning Evaluators
by: Park, Junsoo, et al.
Published: (2024) -
Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization
by: Zhou, Hongli, et al.
Published: (2026) -
Beyond English: Unveiling Multilingual Bias in LLM Copyright Compliance
by: Chen, Yupeng, et al.
Published: (2025) -
On Bias and Fairness in NLP: Investigating the Impact of Bias and Debiasing in Language Models on the Fairness of Toxicity Detection
by: Elsafoury, Fatma, et al.
Published: (2023)