Saved in:
Bibliographic Details
Main Authors: Huang, Siguang, Wang, Yunli, Mou, Lili, Zhang, Huayue, Zhu, Han, Yu, Chuan, Zheng, Bo
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2202.04348
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929754358153216
author Huang, Siguang
Wang, Yunli
Mou, Lili
Zhang, Huayue
Zhu, Han
Yu, Chuan
Zheng, Bo
author_facet Huang, Siguang
Wang, Yunli
Mou, Lili
Zhang, Huayue
Zhu, Han
Yu, Chuan
Zheng, Bo
contents Most machine learning classifiers only concern classification accuracy, while certain applications (such as medical diagnosis, meteorological forecasting, and computation advertising) require the model to predict the true probability, known as a calibrated estimate. In previous work, researchers have developed several calibration methods to post-process the outputs of a predictor to obtain calibrated values, such as binning and scaling methods. Compared with scaling, binning methods are shown to have distribution-free theoretical guarantees, which motivates us to prefer binning methods for calibration. However, we notice that existing binning methods have several drawbacks: (a) the binning scheme only considers the original prediction values, thus limiting the calibration performance; and (b) the binning approach is non-individual, mapping multiple samples in a bin to the same value, and thus is not suitable for order-sensitive applications. In this paper, we propose a feature-aware binning framework, called Multiple Boosting Calibration Trees (MBCT), along with a multi-view calibration loss to tackle the above issues. Our MBCT optimizes the binning scheme by the tree structures of features, and adopts a linear function in a tree node to achieve individual calibration. Our MBCT is non-monotonic, and has the potential to improve order accuracy, due to its learnable binning scheme and the individual calibration. We conduct comprehensive experiments on three datasets in different fields. Results show that our method outperforms all competing models in terms of both calibration error and order accuracy. We also conduct simulation experiments, justifying that the proposed multi-view calibration loss is a better metric in modeling calibration error.
format Preprint
id arxiv_https___arxiv_org_abs_2202_04348
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle MBCT: Tree-Based Feature-Aware Binning for Individual Uncertainty Calibration
Huang, Siguang
Wang, Yunli
Mou, Lili
Zhang, Huayue
Zhu, Han
Yu, Chuan
Zheng, Bo
Machine Learning
Most machine learning classifiers only concern classification accuracy, while certain applications (such as medical diagnosis, meteorological forecasting, and computation advertising) require the model to predict the true probability, known as a calibrated estimate. In previous work, researchers have developed several calibration methods to post-process the outputs of a predictor to obtain calibrated values, such as binning and scaling methods. Compared with scaling, binning methods are shown to have distribution-free theoretical guarantees, which motivates us to prefer binning methods for calibration. However, we notice that existing binning methods have several drawbacks: (a) the binning scheme only considers the original prediction values, thus limiting the calibration performance; and (b) the binning approach is non-individual, mapping multiple samples in a bin to the same value, and thus is not suitable for order-sensitive applications. In this paper, we propose a feature-aware binning framework, called Multiple Boosting Calibration Trees (MBCT), along with a multi-view calibration loss to tackle the above issues. Our MBCT optimizes the binning scheme by the tree structures of features, and adopts a linear function in a tree node to achieve individual calibration. Our MBCT is non-monotonic, and has the potential to improve order accuracy, due to its learnable binning scheme and the individual calibration. We conduct comprehensive experiments on three datasets in different fields. Results show that our method outperforms all competing models in terms of both calibration error and order accuracy. We also conduct simulation experiments, justifying that the proposed multi-view calibration loss is a better metric in modeling calibration error.
title MBCT: Tree-Based Feature-Aware Binning for Individual Uncertainty Calibration
topic Machine Learning
url https://arxiv.org/abs/2202.04348