Saved in:
| Main Authors: | Pan, Jinhao, Raj, Chahat, Yao, Ziyu, Zhu, Ziwei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.19749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bias Association Discovery Framework for Open-Ended LLM Generations
by: Pan, Jinhao, et al.
Published: (2025)
by: Pan, Jinhao, et al.
Published: (2025)
Talent or Luck? Evaluating Attribution Bias in Large Language Models
by: Raj, Chahat, et al.
Published: (2025)
by: Raj, Chahat, et al.
Published: (2025)
Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis
by: Raj, Chahat, et al.
Published: (2024)
by: Raj, Chahat, et al.
Published: (2024)
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models
by: Raj, Chahat, et al.
Published: (2025)
by: Raj, Chahat, et al.
Published: (2025)
BiasDora: Exploring Hidden Biased Associations in Vision-Language Models
by: Raj, Chahat, et al.
Published: (2024)
by: Raj, Chahat, et al.
Published: (2024)
Purdah and Patriarchy: Evaluating and Mitigating South Asian Biases in Open-Ended Multilingual LLM Generations
by: Rinki, Mamnuya, et al.
Published: (2025)
by: Rinki, Mamnuya, et al.
Published: (2025)
KnowBias: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement
by: Pan, Jinhao, et al.
Published: (2026)
by: Pan, Jinhao, et al.
Published: (2026)
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said
by: Bang, Yejin, et al.
Published: (2024)
by: Bang, Yejin, et al.
Published: (2024)
Evaluating Social Bias in RAG Systems: When External Context Helps and Reasoning Hurts
by: Parihar, Shweta, et al.
Published: (2026)
by: Parihar, Shweta, et al.
Published: (2026)
They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References
by: Tripathi, Sahil, et al.
Published: (2026)
by: Tripathi, Sahil, et al.
Published: (2026)
Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models
by: Zhou, Yuqing, et al.
Published: (2024)
by: Zhou, Yuqing, et al.
Published: (2024)
Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models
by: Shin, Jisu, et al.
Published: (2024)
by: Shin, Jisu, et al.
Published: (2024)
When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models
by: Elshabrawy, Ahmed, et al.
Published: (2025)
by: Elshabrawy, Ahmed, et al.
Published: (2025)
Trustworthy Social Bias Measurement
by: Bommasani, Rishi, et al.
Published: (2022)
by: Bommasani, Rishi, et al.
Published: (2022)
Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context
by: Pandey, Ashish, et al.
Published: (2026)
by: Pandey, Ashish, et al.
Published: (2026)
Probing Social Identity Bias in Chinese LLMs with Gendered Pronouns and Social Groups
by: Liu, Geng, et al.
Published: (2025)
by: Liu, Geng, et al.
Published: (2025)
FairCoder: Evaluating Social Bias of LLMs in Code Generation
by: Du, Yongkang, et al.
Published: (2025)
by: Du, Yongkang, et al.
Published: (2025)
CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage
by: Wei, Bowen, et al.
Published: (2025)
by: Wei, Bowen, et al.
Published: (2025)
How Does Differential Privacy Affect Social Bias in LLMs? A Systematic Evaluation
by: Tenorio, Eduardo, et al.
Published: (2026)
by: Tenorio, Eduardo, et al.
Published: (2026)
FM SO.P: A Progressive Task Mixture Framework with Automatic Evaluation for Cross-Domain SOP Understanding
by: Huang, Siyuan, et al.
Published: (2026)
by: Huang, Siyuan, et al.
Published: (2026)
Obscured but Not Erased: Evaluating Nationality Bias in LLMs via Name-Based Bias Benchmarks
by: Pelosio, Giulio, et al.
Published: (2025)
by: Pelosio, Giulio, et al.
Published: (2025)
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
by: Fan, Zhiting, et al.
Published: (2024)
by: Fan, Zhiting, et al.
Published: (2024)
Instruction-Tuning LLMs for Event Extraction with Annotation Guidelines
by: Srivastava, Saurabh, et al.
Published: (2025)
by: Srivastava, Saurabh, et al.
Published: (2025)
A Scalable Entity-Based Framework for Auditing Bias in LLMs
by: Elbouanani, Akram, et al.
Published: (2026)
by: Elbouanani, Akram, et al.
Published: (2026)
When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation
by: Faisal, Faizan
Published: (2026)
by: Faisal, Faizan
Published: (2026)
Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study
by: Majumdar, Ayan, et al.
Published: (2025)
by: Majumdar, Ayan, et al.
Published: (2025)
Large Language Models Still Exhibit Bias in Long Text
by: Jeung, Wonje, et al.
Published: (2024)
by: Jeung, Wonje, et al.
Published: (2024)
A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory
by: Shiotani, Taihei, et al.
Published: (2026)
by: Shiotani, Taihei, et al.
Published: (2026)
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
by: Sun, Yiyou, et al.
Published: (2025)
by: Sun, Yiyou, et al.
Published: (2025)
SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs
by: Lin, Jiacheng, et al.
Published: (2025)
by: Lin, Jiacheng, et al.
Published: (2025)
Is a Peeled Apple Still Red? Evaluating LLMs' Ability for Conceptual Combination with Property Type
by: Song, Seokwon, et al.
Published: (2025)
by: Song, Seokwon, et al.
Published: (2025)
Large Language Models Are Still Misled by Simple Bias Ensembles
by: Sun, Zhouhao, et al.
Published: (2025)
by: Sun, Zhouhao, et al.
Published: (2025)
Relative Bias: A Comparative Framework for Quantifying Bias in LLMs
by: Arbabi, Alireza, et al.
Published: (2025)
by: Arbabi, Alireza, et al.
Published: (2025)
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs
by: Yin, Lake, et al.
Published: (2025)
by: Yin, Lake, et al.
Published: (2025)
A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs
by: Kim, Sean, et al.
Published: (2025)
by: Kim, Sean, et al.
Published: (2025)
User-Assistant Bias in LLMs
by: Pan, Xu, et al.
Published: (2025)
by: Pan, Xu, et al.
Published: (2025)
Evaluating Gender Bias of LLMs in Making Morality Judgements
by: Bajaj, Divij, et al.
Published: (2024)
by: Bajaj, Divij, et al.
Published: (2024)
IndiCASA: A Dataset and Bias Evaluation Framework in LLMs Using Contrastive Embedding Similarity in the Indian Context
by: S, Santhosh G, et al.
Published: (2025)
by: S, Santhosh G, et al.
Published: (2025)
Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations
by: Jin, Jiho, et al.
Published: (2025)
by: Jin, Jiho, et al.
Published: (2025)
Benford's Curse: Tracing Digit Bias to Numerical Hallucination in LLMs
by: Shao, Jiandong, et al.
Published: (2025)
by: Shao, Jiandong, et al.
Published: (2025)
Similar Items
-
Bias Association Discovery Framework for Open-Ended LLM Generations
by: Pan, Jinhao, et al.
Published: (2025) -
Talent or Luck? Evaluating Attribution Bias in Large Language Models
by: Raj, Chahat, et al.
Published: (2025) -
Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis
by: Raj, Chahat, et al.
Published: (2024) -
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models
by: Raj, Chahat, et al.
Published: (2025) -
BiasDora: Exploring Hidden Biased Associations in Vision-Language Models
by: Raj, Chahat, et al.
Published: (2024)