:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Merchant, Humzah, Levy, Bradford
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2512.06607
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Divergence Decoding: Inference-Time Unlearning via Auxiliary Models
by: Merchant, Humzah, et al.
Published: (2026)

Look-Ahead-Bench: a Standardized Benchmark of Look-ahead Bias in Point-in-Time LLMs for Finance
by: Benhenda, Mostapha
Published: (2026)

Psychological Profiling in Cybersecurity: A Look at LLMs and Psycholinguistic Features
by: Tshimula, Jean Marie, et al.
Published: (2024)

Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024)

Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias
by: Chen, Yuen, et al.
Published: (2022)

Investigating Bias: A Multilingual Pipeline for Generating, Solving, and Evaluating Math Problems with LLMs
by: Mahran, Mariam, et al.
Published: (2025)

Looking for the Inner Music: Probing LLMs' Understanding of Literary Style
by: Hicke, Rebecca M. M., et al.
Published: (2025)

Relative Bias: A Comparative Framework for Quantifying Bias in LLMs
by: Arbabi, Alireza, et al.
Published: (2025)

The Confidence Trap: Gender Bias and Predictive Certainty in LLMs
by: Sabir, Ahmed, et al.
Published: (2026)

When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes
by: Yehudai, Asaf, et al.
Published: (2024)

WRAVAL -- WRiting Assist eVALuation
by: Benedict, Gabriel, et al.
Published: (2025)

Bias patterns in the application of LLMs for clinical decision support: A comprehensive study
by: Poulain, Raphael, et al.
Published: (2024)

The Impact of Inference Acceleration on Bias of LLMs
by: Kirsten, Elisabeth, et al.
Published: (2024)

Format as a Prior: Quantifying and Analyzing Bias in LLMs for Heterogeneous Data
by: Liu, Jiacheng, et al.
Published: (2025)

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
by: Zhang, Qi, et al.
Published: (2024)

Fast and Effective Weight Update for Pruned Large Language Models
by: Boža, Vladimír
Published: (2024)

Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution
by: Xu, Nuo, et al.
Published: (2024)

Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study
by: Majumdar, Ayan, et al.
Published: (2025)

Bias in LLMs as Annotators: The Effect of Party Cues on Labelling Decision by Large Language Models
by: Vera, Sebastian Vallejo, et al.
Published: (2024)

Assessing Gender Bias in LLMs: Comparing LLM Outputs with Human Perceptions and Official Statistics
by: Bas, Tetiana
Published: (2024)

Reasoning Beyond Limits: Advances and Open Problems for LLMs
by: Ferrag, Mohamed Amine, et al.
Published: (2025)

Long-Short Alignment for Effective Long-Context Modeling in LLMs
by: Du, Tianqi, et al.
Published: (2025)

Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs
by: Chan, Yung-Chieh, et al.
Published: (2024)

Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
by: Deschenaux, Justin, et al.
Published: (2024)

Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation
by: Beurer-Kellner, Luca, et al.
Published: (2024)

Beyond Early-Token Bias: Model-Specific and Language-Specific Position Effects in Multilingual LLMs
by: Menschikov, Mikhail, et al.
Published: (2025)

Bias Amplification in RAG: Poisoning Knowledge Retrieval to Steer LLMs
by: Wang, Linlin, et al.
Published: (2025)

Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
by: Siddique, Zara, et al.
Published: (2025)

AGR: Age Group fairness Reward for Bias Mitigation in LLMs
by: Cao, Shuirong, et al.
Published: (2024)

Bias Similarity Measurement: A Black-Box Audit of Fairness Across LLMs
by: Jeong, Hyejun, et al.
Published: (2024)

Cost-Effective Hallucination Detection for LLMs
by: Valentin, Simon, et al.
Published: (2024)

XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
by: Hu, Zhiyuan, et al.
Published: (2026)

Easy Problems That LLMs Get Wrong
by: Williams, Sean, et al.
Published: (2024)

Reasoning LLMs are Wandering Solution Explorers
by: Lu, Jiahao, et al.
Published: (2025)

Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs
by: Gao, Xin, et al.
Published: (2025)

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification
by: Bhattacharjee, Payel, et al.
Published: (2025)

Evaluating LLMs in Finance Requires Explicit Bias Consideration
by: Kong, Yaxuan, et al.
Published: (2026)

A Critical Look At Tokenwise Reward-Guided Text Generation
by: Rashid, Ahmad, et al.
Published: (2024)

A Closer Look into Mixture-of-Experts in Large Language Models
by: Lo, Ka Man, et al.
Published: (2024)