Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Papagiannouli, Katerina, Trevisan, Dario, Zitto, Giuseppe Pio
Format:	Preprint
Publié:	2026
Sujets:	Machine Learning G.3
Accès en ligne:	https://arxiv.org/abs/2602.22925
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866914353121329152
author	Papagiannouli, Katerina Trevisan, Dario Zitto, Giuseppe Pio
author_facet	Papagiannouli, Katerina Trevisan, Dario Zitto, Giuseppe Pio
contents	We study wide Bayesian neural networks focusing on the rare but statistically dominant fluctuations that govern posterior concentration, beyond Gaussian-process limits. Large-deviation theory provides explicit variational objectives-rate functions-on predictors, providing an emerging notion of complexity and feature learning directly at the functional level. We show that the posterior output rate function is obtained by a joint optimization over predictors and internal kernels, in contrast with fixed-kernel (NNGP) theory. Numerical experiments demonstrate that the resulting predictions accurately describe finite-width behavior for moderately sized networks, capturing non-Gaussian tails, posterior deformation, and data-dependent kernel selection effects.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_22925
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Beyond NNGP: Large Deviations and Feature Learning in Bayesian Neural Networks Papagiannouli, Katerina Trevisan, Dario Zitto, Giuseppe Pio Machine Learning G.3 We study wide Bayesian neural networks focusing on the rare but statistically dominant fluctuations that govern posterior concentration, beyond Gaussian-process limits. Large-deviation theory provides explicit variational objectives-rate functions-on predictors, providing an emerging notion of complexity and feature learning directly at the functional level. We show that the posterior output rate function is obtained by a joint optimization over predictors and internal kernels, in contrast with fixed-kernel (NNGP) theory. Numerical experiments demonstrate that the resulting predictions accurately describe finite-width behavior for moderately sized networks, capturing non-Gaussian tails, posterior deformation, and data-dependent kernel selection effects.
title	Beyond NNGP: Large Deviations and Feature Learning in Bayesian Neural Networks
topic	Machine Learning G.3
url	https://arxiv.org/abs/2602.22925

Documents similaires