Saved in:
Bibliographic Details
Main Authors: Esaki, Yasushi, Nakamura, Akihiro, Kawano, Keisuke, Tokuhisa, Ryoko, Kutsuna, Takuro
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.13765
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912278723428352
author Esaki, Yasushi
Nakamura, Akihiro
Kawano, Keisuke
Tokuhisa, Ryoko
Kutsuna, Takuro
author_facet Esaki, Yasushi
Nakamura, Akihiro
Kawano, Keisuke
Tokuhisa, Ryoko
Kutsuna, Takuro
contents Classification models based on deep neural networks (DNNs) must be calibrated to measure the reliability of predictions. Some recent calibration methods have employed a probabilistic model on the probability simplex. However, these calibration methods cannot preserve the accuracy of pre-trained models, even those with a high classification accuracy. We propose an accuracy-preserving calibration method using the Concrete distribution as the probabilistic model on the probability simplex. We theoretically prove that a DNN model trained on cross-entropy loss has optimality as the parameter of the Concrete distribution. We also propose an efficient method that synthetically generates samples for training probabilistic models on the probability simplex. We demonstrate that the proposed method can outperform previous methods in accuracy-preserving calibration tasks using benchmarks. The code is available at https://github.com/ToyotaCRDL/SimplexTS.
format Preprint
id arxiv_https___arxiv_org_abs_2402_13765
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Accuracy-Preserving Calibration via Statistical Modeling on Probability Simplex
Esaki, Yasushi
Nakamura, Akihiro
Kawano, Keisuke
Tokuhisa, Ryoko
Kutsuna, Takuro
Machine Learning
Classification models based on deep neural networks (DNNs) must be calibrated to measure the reliability of predictions. Some recent calibration methods have employed a probabilistic model on the probability simplex. However, these calibration methods cannot preserve the accuracy of pre-trained models, even those with a high classification accuracy. We propose an accuracy-preserving calibration method using the Concrete distribution as the probabilistic model on the probability simplex. We theoretically prove that a DNN model trained on cross-entropy loss has optimality as the parameter of the Concrete distribution. We also propose an efficient method that synthetically generates samples for training probabilistic models on the probability simplex. We demonstrate that the proposed method can outperform previous methods in accuracy-preserving calibration tasks using benchmarks. The code is available at https://github.com/ToyotaCRDL/SimplexTS.
title Accuracy-Preserving Calibration via Statistical Modeling on Probability Simplex
topic Machine Learning
url https://arxiv.org/abs/2402.13765