Saved in:
Bibliographic Details
Main Authors: Benger, Etam, Ligett, Katrina
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.07285
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911730394726400
author Benger, Etam
Ligett, Katrina
author_facet Benger, Etam
Ligett, Katrina
contents Binary classification based on predicted probabilities (scores) is a fundamental task in supervised machine learning. While thresholding scores is Bayes-optimal in the unconstrained setting, using a single threshold generally violates statistical group fairness constraints. Under independence (statistical parity) and separation (equalized odds), such thresholding suffices when the scores already satisfy the corresponding criterion. However, this does not extend to sufficiency: even perfectly group-calibrated scores -- including true class probabilities -- violate predictive parity after thresholding. In this work, we present an exact solution for optimal binary (randomized) classification under sufficiency, assuming finite sets of group-calibrated scores. We provide a geometric characterization of the feasible pairs of positive predictive value (PPV) and false omission rate (FOR) achievable by such classifiers, and use it to derive a simple post-processing algorithm that attains the optimal classifier using only group-calibrated scores and group membership. Finally, since sufficiency and separation are generally incompatible, we identify the classifier that minimizes deviation from separation subject to sufficiency, and show that it can also be obtained by our algorithm, often achieving performance comparable to the optimum.
format Preprint
id arxiv_https___arxiv_org_abs_2602_07285
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Fair Decisions from Calibrated Scores: Achieving Optimal Classification While Satisfying Sufficiency
Benger, Etam
Ligett, Katrina
Machine Learning
Binary classification based on predicted probabilities (scores) is a fundamental task in supervised machine learning. While thresholding scores is Bayes-optimal in the unconstrained setting, using a single threshold generally violates statistical group fairness constraints. Under independence (statistical parity) and separation (equalized odds), such thresholding suffices when the scores already satisfy the corresponding criterion. However, this does not extend to sufficiency: even perfectly group-calibrated scores -- including true class probabilities -- violate predictive parity after thresholding. In this work, we present an exact solution for optimal binary (randomized) classification under sufficiency, assuming finite sets of group-calibrated scores. We provide a geometric characterization of the feasible pairs of positive predictive value (PPV) and false omission rate (FOR) achievable by such classifiers, and use it to derive a simple post-processing algorithm that attains the optimal classifier using only group-calibrated scores and group membership. Finally, since sufficiency and separation are generally incompatible, we identify the classifier that minimizes deviation from separation subject to sufficiency, and show that it can also be obtained by our algorithm, often achieving performance comparable to the optimum.
title Fair Decisions from Calibrated Scores: Achieving Optimal Classification While Satisfying Sufficiency
topic Machine Learning
url https://arxiv.org/abs/2602.07285