Saved in:
Bibliographic Details
Main Authors: Gruber, Sebastian G., Bach, Francis
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.07014
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910838598664192
author Gruber, Sebastian G.
Bach, Francis
author_facet Gruber, Sebastian G.
Bach, Francis
contents In this work, we propose a mean-squared error-based risk that enables the comparison and optimization of estimators of squared calibration errors in practical settings. Improving the calibration of classifiers is crucial for enhancing the trustworthiness and interpretability of machine learning models, especially in sensitive decision-making scenarios. Although various calibration (error) estimators exist in the current literature, there is a lack of guidance on selecting the appropriate estimator and tuning its hyperparameters. By leveraging the bilinear structure of squared calibration errors, we reformulate calibration estimation as a regression problem with independent and identically distributed (i.i.d.) input pairs. This reformulation allows us to quantify the performance of different estimators even for the most challenging calibration criterion, known as canonical calibration. Our approach advocates for a training-validation-testing pipeline when estimating a calibration error on an evaluation dataset. We demonstrate the effectiveness of our pipeline by optimizing existing calibration estimators and comparing them with novel kernel ridge regression-based estimators on standard image classification tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2410_07014
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Optimizing Estimators of Squared Calibration Errors in Classification
Gruber, Sebastian G.
Bach, Francis
Machine Learning
In this work, we propose a mean-squared error-based risk that enables the comparison and optimization of estimators of squared calibration errors in practical settings. Improving the calibration of classifiers is crucial for enhancing the trustworthiness and interpretability of machine learning models, especially in sensitive decision-making scenarios. Although various calibration (error) estimators exist in the current literature, there is a lack of guidance on selecting the appropriate estimator and tuning its hyperparameters. By leveraging the bilinear structure of squared calibration errors, we reformulate calibration estimation as a regression problem with independent and identically distributed (i.i.d.) input pairs. This reformulation allows us to quantify the performance of different estimators even for the most challenging calibration criterion, known as canonical calibration. Our approach advocates for a training-validation-testing pipeline when estimating a calibration error on an evaluation dataset. We demonstrate the effectiveness of our pipeline by optimizing existing calibration estimators and comparing them with novel kernel ridge regression-based estimators on standard image classification tasks.
title Optimizing Estimators of Squared Calibration Errors in Classification
topic Machine Learning
url https://arxiv.org/abs/2410.07014