Saved in:
Bibliographic Details
Main Authors: Jung, Ádám, Kelen, Domokos M., Benczúr, András A.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.13362
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917274420510720
author Jung, Ádám
Kelen, Domokos M.
Benczúr, András A.
author_facet Jung, Ádám
Kelen, Domokos M.
Benczúr, András A.
contents A key challenge in probabilistic regression is ensuring that predictive distributions accurately reflect true empirical uncertainty. Minimizing overall prediction error often encourages models to prioritize informativeness over calibration, producing narrow but overconfident predictions. However, in safety-critical settings, trustworthy uncertainty estimates are often more valuable than narrow intervals. Realizing the problem, several recent works have focused on post-hoc corrections; however, existing methods either rely on weak notions of calibration (such as PIT uniformity) or impose restrictive parametric assumptions on the nature of the error. To address these limitations, we propose a novel nonparametric re-calibration algorithm based on conditional kernel mean embeddings, capable of correcting calibration error without restrictive modeling assumptions. For efficient inference with real-valued targets, we introduce a novel characteristic kernel over distributions that can be evaluated in $\mathcal{O}(n \log n)$ time for empirical distributions of size $n$. We demonstrate that our method consistently outperforms prior re-calibration approaches across a diverse set of regression benchmarks and model classes.
format Preprint
id arxiv_https___arxiv_org_abs_2602_13362
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Nonparametric Distribution Regression Re-calibration
Jung, Ádám
Kelen, Domokos M.
Benczúr, András A.
Machine Learning
Artificial Intelligence
A key challenge in probabilistic regression is ensuring that predictive distributions accurately reflect true empirical uncertainty. Minimizing overall prediction error often encourages models to prioritize informativeness over calibration, producing narrow but overconfident predictions. However, in safety-critical settings, trustworthy uncertainty estimates are often more valuable than narrow intervals. Realizing the problem, several recent works have focused on post-hoc corrections; however, existing methods either rely on weak notions of calibration (such as PIT uniformity) or impose restrictive parametric assumptions on the nature of the error. To address these limitations, we propose a novel nonparametric re-calibration algorithm based on conditional kernel mean embeddings, capable of correcting calibration error without restrictive modeling assumptions. For efficient inference with real-valued targets, we introduce a novel characteristic kernel over distributions that can be evaluated in $\mathcal{O}(n \log n)$ time for empirical distributions of size $n$. We demonstrate that our method consistently outperforms prior re-calibration approaches across a diverse set of regression benchmarks and model classes.
title Nonparametric Distribution Regression Re-calibration
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2602.13362