:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Pan, Jinhao, Raj, Chahat, Yao, Ziyu, Zhu, Ziwei
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.19749
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Bias Association Discovery Framework for Open-Ended LLM Generations
by: Pan, Jinhao, et al.
Published: (2025)

Talent or Luck? Evaluating Attribution Bias in Large Language Models
by: Raj, Chahat, et al.
Published: (2025)

Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis
by: Raj, Chahat, et al.
Published: (2024)

VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models
by: Raj, Chahat, et al.
Published: (2025)

BiasDora: Exploring Hidden Biased Associations in Vision-Language Models
by: Raj, Chahat, et al.
Published: (2024)

Purdah and Patriarchy: Evaluating and Mitigating South Asian Biases in Open-Ended Multilingual LLM Generations
by: Rinki, Mamnuya, et al.
Published: (2025)

KnowBias: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement
by: Pan, Jinhao, et al.
Published: (2026)

Measuring Political Bias in Large Language Models: What Is Said and How It Is Said
by: Bang, Yejin, et al.
Published: (2024)

Evaluating Social Bias in RAG Systems: When External Context Helps and Reasoning Hurts
by: Parihar, Shweta, et al.
Published: (2026)

They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References
by: Tripathi, Sahil, et al.
Published: (2026)

Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models
by: Zhou, Yuqing, et al.
Published: (2024)

Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models
by: Shin, Jisu, et al.
Published: (2024)

When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models
by: Elshabrawy, Ahmed, et al.
Published: (2025)

Trustworthy Social Bias Measurement
by: Bommasani, Rishi, et al.
Published: (2022)

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context
by: Pandey, Ashish, et al.
Published: (2026)

Probing Social Identity Bias in Chinese LLMs with Gendered Pronouns and Social Groups
by: Liu, Geng, et al.
Published: (2025)

FairCoder: Evaluating Social Bias of LLMs in Code Generation
by: Du, Yongkang, et al.
Published: (2025)

CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage
by: Wei, Bowen, et al.
Published: (2025)

How Does Differential Privacy Affect Social Bias in LLMs? A Systematic Evaluation
by: Tenorio, Eduardo, et al.
Published: (2026)

FM SO.P: A Progressive Task Mixture Framework with Automatic Evaluation for Cross-Domain SOP Understanding
by: Huang, Siyuan, et al.
Published: (2026)

Obscured but Not Erased: Evaluating Nationality Bias in LLMs via Name-Based Bias Benchmarks
by: Pelosio, Giulio, et al.
Published: (2025)

BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
by: Fan, Zhiting, et al.
Published: (2024)

Instruction-Tuning LLMs for Event Extraction with Annotation Guidelines
by: Srivastava, Saurabh, et al.
Published: (2025)

A Scalable Entity-Based Framework for Auditing Bias in LLMs
by: Elbouanani, Akram, et al.
Published: (2026)

When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation
by: Faisal, Faizan
Published: (2026)

Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study
by: Majumdar, Ayan, et al.
Published: (2025)

Large Language Models Still Exhibit Bias in Long Text
by: Jeung, Wonje, et al.
Published: (2024)

A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory
by: Shiotani, Taihei, et al.
Published: (2026)

Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
by: Sun, Yiyou, et al.
Published: (2025)

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs
by: Lin, Jiacheng, et al.
Published: (2025)

Is a Peeled Apple Still Red? Evaluating LLMs' Ability for Conceptual Combination with Property Type
by: Song, Seokwon, et al.
Published: (2025)

Large Language Models Are Still Misled by Simple Bias Ensembles
by: Sun, Zhouhao, et al.
Published: (2025)

Relative Bias: A Comparative Framework for Quantifying Bias in LLMs
by: Arbabi, Alireza, et al.
Published: (2025)

DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs
by: Yin, Lake, et al.
Published: (2025)

A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs
by: Kim, Sean, et al.
Published: (2025)

User-Assistant Bias in LLMs
by: Pan, Xu, et al.
Published: (2025)

Evaluating Gender Bias of LLMs in Making Morality Judgements
by: Bajaj, Divij, et al.
Published: (2024)

IndiCASA: A Dataset and Bias Evaluation Framework in LLMs Using Contrastive Embedding Similarity in the Indian Context
by: S, Santhosh G, et al.
Published: (2025)

Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations
by: Jin, Jiho, et al.
Published: (2025)

Benford's Curse: Tracing Digit Bias to Numerical Hallucination in LLMs
by: Shao, Jiandong, et al.
Published: (2025)