Guardado en:
Detalles Bibliográficos
Autores principales: Caprio, Michele, Stutz, David, Li, Shuo, Doucet, Arnaud
Formato: Preprint
Publicado: 2024
Materias:
Acceso en línea:https://arxiv.org/abs/2411.04852
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866916586226450432
author Caprio, Michele
Stutz, David
Li, Shuo
Doucet, Arnaud
author_facet Caprio, Michele
Stutz, David
Li, Shuo
Doucet, Arnaud
contents An open question in \emph{Imprecise Probabilistic Machine Learning} is how to empirically derive a credal region (i.e., a closed and convex family of probabilities on the output space) from the available data, without any prior knowledge or assumption. In classification problems, credal regions are a tool that is able to provide provable guarantees under realistic assumptions by characterizing the uncertainty about the distribution of the labels. Building on previous work, we show that credal regions can be directly constructed using conformal methods. This allows us to provide a novel extension of classical conformal prediction to problems with ambiguous ground truth, that is, when the exact labels for given inputs are not exactly known. The resulting construction enjoys desirable practical and theoretical properties: (i) conformal coverage guarantees, (ii) smaller prediction sets (compared to classical conformal prediction regions) and (iii) disentanglement of uncertainty sources (epistemic, aleatoric). We empirically verify our findings on both synthetic and real datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2411_04852
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Conformalized Credal Regions for Classification with Ambiguous Ground Truth
Caprio, Michele
Stutz, David
Li, Shuo
Doucet, Arnaud
Machine Learning
An open question in \emph{Imprecise Probabilistic Machine Learning} is how to empirically derive a credal region (i.e., a closed and convex family of probabilities on the output space) from the available data, without any prior knowledge or assumption. In classification problems, credal regions are a tool that is able to provide provable guarantees under realistic assumptions by characterizing the uncertainty about the distribution of the labels. Building on previous work, we show that credal regions can be directly constructed using conformal methods. This allows us to provide a novel extension of classical conformal prediction to problems with ambiguous ground truth, that is, when the exact labels for given inputs are not exactly known. The resulting construction enjoys desirable practical and theoretical properties: (i) conformal coverage guarantees, (ii) smaller prediction sets (compared to classical conformal prediction regions) and (iii) disentanglement of uncertainty sources (epistemic, aleatoric). We empirically verify our findings on both synthetic and real datasets.
title Conformalized Credal Regions for Classification with Ambiguous Ground Truth
topic Machine Learning
url https://arxiv.org/abs/2411.04852