Saved in:
| Main Authors: | Merchant, Humzah, Levy, Bradford |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.06607 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Divergence Decoding: Inference-Time Unlearning via Auxiliary Models
by: Merchant, Humzah, et al.
Published: (2026)
by: Merchant, Humzah, et al.
Published: (2026)
Look-Ahead-Bench: a Standardized Benchmark of Look-ahead Bias in Point-in-Time LLMs for Finance
by: Benhenda, Mostapha
Published: (2026)
by: Benhenda, Mostapha
Published: (2026)
Psychological Profiling in Cybersecurity: A Look at LLMs and Psycholinguistic Features
by: Tshimula, Jean Marie, et al.
Published: (2024)
by: Tshimula, Jean Marie, et al.
Published: (2024)
Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024)
by: Wu, Wilson, et al.
Published: (2024)
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias
by: Chen, Yuen, et al.
Published: (2022)
by: Chen, Yuen, et al.
Published: (2022)
Investigating Bias: A Multilingual Pipeline for Generating, Solving, and Evaluating Math Problems with LLMs
by: Mahran, Mariam, et al.
Published: (2025)
by: Mahran, Mariam, et al.
Published: (2025)
Looking for the Inner Music: Probing LLMs' Understanding of Literary Style
by: Hicke, Rebecca M. M., et al.
Published: (2025)
by: Hicke, Rebecca M. M., et al.
Published: (2025)
Relative Bias: A Comparative Framework for Quantifying Bias in LLMs
by: Arbabi, Alireza, et al.
Published: (2025)
by: Arbabi, Alireza, et al.
Published: (2025)
The Confidence Trap: Gender Bias and Predictive Certainty in LLMs
by: Sabir, Ahmed, et al.
Published: (2026)
by: Sabir, Ahmed, et al.
Published: (2026)
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes
by: Yehudai, Asaf, et al.
Published: (2024)
by: Yehudai, Asaf, et al.
Published: (2024)
WRAVAL -- WRiting Assist eVALuation
by: Benedict, Gabriel, et al.
Published: (2025)
by: Benedict, Gabriel, et al.
Published: (2025)
Bias patterns in the application of LLMs for clinical decision support: A comprehensive study
by: Poulain, Raphael, et al.
Published: (2024)
by: Poulain, Raphael, et al.
Published: (2024)
The Impact of Inference Acceleration on Bias of LLMs
by: Kirsten, Elisabeth, et al.
Published: (2024)
by: Kirsten, Elisabeth, et al.
Published: (2024)
Format as a Prior: Quantifying and Analyzing Bias in LLMs for Heterogeneous Data
by: Liu, Jiacheng, et al.
Published: (2025)
by: Liu, Jiacheng, et al.
Published: (2025)
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
by: Zhang, Qi, et al.
Published: (2024)
by: Zhang, Qi, et al.
Published: (2024)
Fast and Effective Weight Update for Pruned Large Language Models
by: Boža, Vladimír
Published: (2024)
by: Boža, Vladimír
Published: (2024)
Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution
by: Xu, Nuo, et al.
Published: (2024)
by: Xu, Nuo, et al.
Published: (2024)
Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study
by: Majumdar, Ayan, et al.
Published: (2025)
by: Majumdar, Ayan, et al.
Published: (2025)
Bias in LLMs as Annotators: The Effect of Party Cues on Labelling Decision by Large Language Models
by: Vera, Sebastian Vallejo, et al.
Published: (2024)
by: Vera, Sebastian Vallejo, et al.
Published: (2024)
Assessing Gender Bias in LLMs: Comparing LLM Outputs with Human Perceptions and Official Statistics
by: Bas, Tetiana
Published: (2024)
by: Bas, Tetiana
Published: (2024)
Reasoning Beyond Limits: Advances and Open Problems for LLMs
by: Ferrag, Mohamed Amine, et al.
Published: (2025)
by: Ferrag, Mohamed Amine, et al.
Published: (2025)
Long-Short Alignment for Effective Long-Context Modeling in LLMs
by: Du, Tianqi, et al.
Published: (2025)
by: Du, Tianqi, et al.
Published: (2025)
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs
by: Chan, Yung-Chieh, et al.
Published: (2024)
by: Chan, Yung-Chieh, et al.
Published: (2024)
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
by: Deschenaux, Justin, et al.
Published: (2024)
by: Deschenaux, Justin, et al.
Published: (2024)
Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation
by: Beurer-Kellner, Luca, et al.
Published: (2024)
by: Beurer-Kellner, Luca, et al.
Published: (2024)
Beyond Early-Token Bias: Model-Specific and Language-Specific Position Effects in Multilingual LLMs
by: Menschikov, Mikhail, et al.
Published: (2025)
by: Menschikov, Mikhail, et al.
Published: (2025)
Bias Amplification in RAG: Poisoning Knowledge Retrieval to Steer LLMs
by: Wang, Linlin, et al.
Published: (2025)
by: Wang, Linlin, et al.
Published: (2025)
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
by: Siddique, Zara, et al.
Published: (2025)
by: Siddique, Zara, et al.
Published: (2025)
AGR: Age Group fairness Reward for Bias Mitigation in LLMs
by: Cao, Shuirong, et al.
Published: (2024)
by: Cao, Shuirong, et al.
Published: (2024)
Bias Similarity Measurement: A Black-Box Audit of Fairness Across LLMs
by: Jeong, Hyejun, et al.
Published: (2024)
by: Jeong, Hyejun, et al.
Published: (2024)
Cost-Effective Hallucination Detection for LLMs
by: Valentin, Simon, et al.
Published: (2024)
by: Valentin, Simon, et al.
Published: (2024)
XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)
by: Zhang, Zhihan, et al.
Published: (2025)
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
by: Hu, Zhiyuan, et al.
Published: (2026)
by: Hu, Zhiyuan, et al.
Published: (2026)
Easy Problems That LLMs Get Wrong
by: Williams, Sean, et al.
Published: (2024)
by: Williams, Sean, et al.
Published: (2024)
Reasoning LLMs are Wandering Solution Explorers
by: Lu, Jiahao, et al.
Published: (2025)
by: Lu, Jiahao, et al.
Published: (2025)
Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs
by: Gao, Xin, et al.
Published: (2025)
by: Gao, Xin, et al.
Published: (2025)
Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification
by: Bhattacharjee, Payel, et al.
Published: (2025)
by: Bhattacharjee, Payel, et al.
Published: (2025)
Evaluating LLMs in Finance Requires Explicit Bias Consideration
by: Kong, Yaxuan, et al.
Published: (2026)
by: Kong, Yaxuan, et al.
Published: (2026)
A Critical Look At Tokenwise Reward-Guided Text Generation
by: Rashid, Ahmad, et al.
Published: (2024)
by: Rashid, Ahmad, et al.
Published: (2024)
A Closer Look into Mixture-of-Experts in Large Language Models
by: Lo, Ka Man, et al.
Published: (2024)
by: Lo, Ka Man, et al.
Published: (2024)
Similar Items
-
Divergence Decoding: Inference-Time Unlearning via Auxiliary Models
by: Merchant, Humzah, et al.
Published: (2026) -
Look-Ahead-Bench: a Standardized Benchmark of Look-ahead Bias in Point-in-Time LLMs for Finance
by: Benhenda, Mostapha
Published: (2026) -
Psychological Profiling in Cybersecurity: A Look at LLMs and Psycholinguistic Features
by: Tshimula, Jean Marie, et al.
Published: (2024) -
Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024) -
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias
by: Chen, Yuen, et al.
Published: (2022)