Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sumler, Daniel James, Devlin, Lee, Maskell, Simon, Lane, Richard O.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.14545
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915163059257344
author	Sumler, Daniel James Devlin, Lee Maskell, Simon Lane, Richard O.
author_facet	Sumler, Daniel James Devlin, Lee Maskell, Simon Lane, Richard O.
contents	Understanding the confidence with which a machine learning model classifies an input datum is an important, and perhaps under-investigated, concept. In this paper, we propose a new calibration metric, the Entropic Calibration Difference (ECD). Based on existing research in the field of state estimation, specifically target tracking (TT), we show how ECD may be applied to binary classification machine learning models. We describe the relative importance of under- and over-confidence and how they are not conflated in the TT literature. Indeed, our metric distinguishes under- from over-confidence. We consider this important given that algorithms that are under-confident are likely to be 'safer' than algorithms that are over-confident, albeit at the expense of also being over-cautious and so statistically inefficient. We demonstrate how this new metric performs on real and simulated data and compare with other metrics for machine learning model probability calibration, including the Expected Calibration Error (ECE) and its signed counterpart, the Expected Signed Calibration Error (ESCE).
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_14545
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	An Entropic Metric for Measuring Calibration of Machine Learning Models Sumler, Daniel James Devlin, Lee Maskell, Simon Lane, Richard O. Machine Learning Understanding the confidence with which a machine learning model classifies an input datum is an important, and perhaps under-investigated, concept. In this paper, we propose a new calibration metric, the Entropic Calibration Difference (ECD). Based on existing research in the field of state estimation, specifically target tracking (TT), we show how ECD may be applied to binary classification machine learning models. We describe the relative importance of under- and over-confidence and how they are not conflated in the TT literature. Indeed, our metric distinguishes under- from over-confidence. We consider this important given that algorithms that are under-confident are likely to be 'safer' than algorithms that are over-confident, albeit at the expense of also being over-cautious and so statistically inefficient. We demonstrate how this new metric performs on real and simulated data and compare with other metrics for machine learning model probability calibration, including the Expected Calibration Error (ECE) and its signed counterpart, the Expected Signed Calibration Error (ESCE).
title	An Entropic Metric for Measuring Calibration of Machine Learning Models
topic	Machine Learning
url	https://arxiv.org/abs/2502.14545

Similar Items