Saved in:
| Main Authors: | Berman, Eliza, Chang, Bella, Neill, Daniel B., Black, Emily |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.05224 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Generative Monoculture in Large Language Models
by: Wu, Fan, et al.
Published: (2024)
by: Wu, Fan, et al.
Published: (2024)
Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts
by: Lin, Yujie, et al.
Published: (2026)
by: Lin, Yujie, et al.
Published: (2026)
Contractual Deepfakes: Can Large Language Models Generate Contracts?
by: Mik, Eliza
Published: (2026)
by: Mik, Eliza
Published: (2026)
A Longitudinal Measurement of Privacy Policy Evolution for Large Language Models
by: Tao, Zhen, et al.
Published: (2025)
by: Tao, Zhen, et al.
Published: (2025)
Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models
by: Takagi, Hirohane, et al.
Published: (2025)
by: Takagi, Hirohane, et al.
Published: (2025)
CogBias: Measuring and Mitigating Cognitive Bias in Large Language Models
by: Huang, Fan, et al.
Published: (2026)
by: Huang, Fan, et al.
Published: (2026)
Do Large Language Models Know What They Are Capable Of?
by: Barkan, Casey O., et al.
Published: (2025)
by: Barkan, Casey O., et al.
Published: (2025)
BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models
by: Blankenstein, Thierry, et al.
Published: (2025)
by: Blankenstein, Thierry, et al.
Published: (2025)
Backdooring Bias in Large Language Models
by: Das, Anudeep, et al.
Published: (2026)
by: Das, Anudeep, et al.
Published: (2026)
Regional Bias in Large Language Models
by: Gopinadh, M P V S, et al.
Published: (2026)
by: Gopinadh, M P V S, et al.
Published: (2026)
Exploring the Jungle of Bias: Political Bias Attribution in Language Models via Dependency Analysis
by: Jenny, David F., et al.
Published: (2023)
by: Jenny, David F., et al.
Published: (2023)
DataDignity: Training Data Attribution for Large Language Models
by: Li, Xiaomin, et al.
Published: (2026)
by: Li, Xiaomin, et al.
Published: (2026)
Source Attribution for Large Language Model-Generated Data
by: Wang, Jingtan, et al.
Published: (2023)
by: Wang, Jingtan, et al.
Published: (2023)
Assessing Political Bias in Large Language Models
by: Rettenberger, Luca, et al.
Published: (2024)
by: Rettenberger, Luca, et al.
Published: (2024)
Bias Amplification: Large Language Models as Increasingly Biased Media
by: Wang, Ze, et al.
Published: (2024)
by: Wang, Ze, et al.
Published: (2024)
Analyzing Memorization in Large Language Models through the Lens of Model Attribution
by: Menta, Tarun Ram, et al.
Published: (2025)
by: Menta, Tarun Ram, et al.
Published: (2025)
Noiser: Bounded Input Perturbations for Attributing Large Language Models
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2025)
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2025)
Explaining the Reasoning of Large Language Models Using Attribution Graphs
by: Walker, Chase, et al.
Published: (2025)
by: Walker, Chase, et al.
Published: (2025)
Advancing Large Language Model Attribution through Self-Improving
by: Huang, Lei, et al.
Published: (2024)
by: Huang, Lei, et al.
Published: (2024)
Locating and Mitigating Gender Bias in Large Language Models
by: Cai, Yuchen, et al.
Published: (2024)
by: Cai, Yuchen, et al.
Published: (2024)
Cultural Bias and Cultural Alignment of Large Language Models
by: Tao, Yan, et al.
Published: (2023)
by: Tao, Yan, et al.
Published: (2023)
Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models
by: Jiang, Xinyan, et al.
Published: (2025)
by: Jiang, Xinyan, et al.
Published: (2025)
Cross-Language Bias Examination in Large Language Models
by: Liang, Yuxuan, et al.
Published: (2025)
by: Liang, Yuxuan, et al.
Published: (2025)
Using Protected Attributes to Consider Fairness in Multi-Agent Systems
by: La Malfa, Gabriele, et al.
Published: (2024)
by: La Malfa, Gabriele, et al.
Published: (2024)
Quantifying Self-Preservation Bias in Large Language Models
by: Migliarini, Matteo, et al.
Published: (2026)
by: Migliarini, Matteo, et al.
Published: (2026)
Assessing the Creativity of Large Language Models: Testing, Limits, and New Frontiers
by: Schapiro, Samuel, et al.
Published: (2026)
by: Schapiro, Samuel, et al.
Published: (2026)
No LLM is Free From Bias: A Comprehensive Study of Bias Evaluation in Large Language Models
by: Kumar, Charaka Vinayak, et al.
Published: (2025)
by: Kumar, Charaka Vinayak, et al.
Published: (2025)
Document Attribution: Examining Citation Relationships using Large Language Models
by: Rawte, Vipula, et al.
Published: (2025)
by: Rawte, Vipula, et al.
Published: (2025)
Revisiting Large Language Model Pruning using Neuron Semantic Attribution
by: Ding, Yizhuo, et al.
Published: (2025)
by: Ding, Yizhuo, et al.
Published: (2025)
Learning Fine-Grained Grounded Citations for Attributed Large Language Models
by: Huang, Lei, et al.
Published: (2024)
by: Huang, Lei, et al.
Published: (2024)
Behavioral Bias of Vision-Language Models: A Behavioral Finance View
by: Xiao, Yuhang, et al.
Published: (2024)
by: Xiao, Yuhang, et al.
Published: (2024)
Multi-Persona Thinking for Bias Mitigation in Large Language Models
by: Chen, Yuxing, et al.
Published: (2026)
by: Chen, Yuxing, et al.
Published: (2026)
Measuring and Mitigating Bias in Code Generated by Large Language Models
by: Chen, Yuxi, et al.
Published: (2026)
by: Chen, Yuxi, et al.
Published: (2026)
Prompt Programming for Cultural Bias and Alignment of Large Language Models
by: Eren, Maksim, et al.
Published: (2026)
by: Eren, Maksim, et al.
Published: (2026)
Likelihood-based Mitigation of Evaluation Bias in Large Language Models
by: Oi, Masanari, et al.
Published: (2024)
by: Oi, Masanari, et al.
Published: (2024)
Mitigating Propensity Bias of Large Language Models for Recommender Systems
by: Zhang, Guixian, et al.
Published: (2024)
by: Zhang, Guixian, et al.
Published: (2024)
Large Language Models Are Still Misled by Simple Bias Ensembles
by: Sun, Zhouhao, et al.
Published: (2025)
by: Sun, Zhouhao, et al.
Published: (2025)
Evaluation of Bias Towards Medical Professionals in Large Language Models
by: Chen, Xi, et al.
Published: (2024)
by: Chen, Xi, et al.
Published: (2024)
Uncovering Political Bias in Large Language Models using Parliamentary Voting Records
by: Chen, Jieying, et al.
Published: (2026)
by: Chen, Jieying, et al.
Published: (2026)
Bias of AI-Generated Content: An Examination of News Produced by Large Language Models
by: Fang, Xiao, et al.
Published: (2023)
by: Fang, Xiao, et al.
Published: (2023)
Similar Items
-
Generative Monoculture in Large Language Models
by: Wu, Fan, et al.
Published: (2024) -
Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts
by: Lin, Yujie, et al.
Published: (2026) -
Contractual Deepfakes: Can Large Language Models Generate Contracts?
by: Mik, Eliza
Published: (2026) -
A Longitudinal Measurement of Privacy Policy Evolution for Large Language Models
by: Tao, Zhen, et al.
Published: (2025) -
Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models
by: Takagi, Hirohane, et al.
Published: (2025)