Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wei, Yadi, Khardon, Roni
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2302.02420
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913572180721664
author	Wei, Yadi Khardon, Roni
author_facet	Wei, Yadi Khardon, Roni
contents	Traditional neural networks are simple to train but they typically produce overconfident predictions. In contrast, Bayesian neural networks provide good uncertainty quantification but optimizing them is time consuming due to the large parameter space. This paper proposes to combine the advantages of both approaches by performing Variational Inference in the Final layer Output space (VIFO), because the output space is much smaller than the parameter space. We use neural networks to learn the mean and the variance of the probabilistic output. Using the Bayesian formulation we incorporate collapsed variational inference with VIFO which significantly improves the performance in practice. On the other hand, like standard, non-Bayesian models, VIFO enjoys simple training and one can use Rademacher complexity to provide risk bounds for the model. Experiments show that VIFO provides a good tradeoff in terms of run time and uncertainty quantification, especially for out of distribution data.
format	Preprint
id	arxiv_https___arxiv_org_abs_2302_02420
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Variational Inference on the Final-Layer Output of Neural Networks Wei, Yadi Khardon, Roni Machine Learning Traditional neural networks are simple to train but they typically produce overconfident predictions. In contrast, Bayesian neural networks provide good uncertainty quantification but optimizing them is time consuming due to the large parameter space. This paper proposes to combine the advantages of both approaches by performing Variational Inference in the Final layer Output space (VIFO), because the output space is much smaller than the parameter space. We use neural networks to learn the mean and the variance of the probabilistic output. Using the Bayesian formulation we incorporate collapsed variational inference with VIFO which significantly improves the performance in practice. On the other hand, like standard, non-Bayesian models, VIFO enjoys simple training and one can use Rademacher complexity to provide risk bounds for the model. Experiments show that VIFO provides a good tradeoff in terms of run time and uncertainty quantification, especially for out of distribution data.
title	Variational Inference on the Final-Layer Output of Neural Networks
topic	Machine Learning
url	https://arxiv.org/abs/2302.02420

Similar Items