MARC erregistroa: :: Library Catalog

Gorde:

Xehetasun bibliografikoak
Egile Nagusiak:	Chugg, Ben, Wang, Hongjian, Ramdas, Aaditya
Formatua:	Preprint
Argitaratua:	2023
Gaiak:	Machine Learning Information Theory Statistics Theory
Sarrera elektronikoa:	https://arxiv.org/abs/2302.03421
Etiketak:	Etiketa erantsi Etiketarik gabe, Izan zaitez lehena erregistro honi etiketa jartzen!

_version_	1866913183599427584
author	Chugg, Ben Wang, Hongjian Ramdas, Aaditya
author_facet	Chugg, Ben Wang, Hongjian Ramdas, Aaditya
contents	We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike most previous literature on this topic, our bounds are anytime-valid (i.e., time-uniform), meaning that they hold at all stopping times, not only for a fixed sample size. Our approach combines four tools in the following order: (a) nonnegative supermartingales or reverse submartingales, (b) the method of mixtures, (c) the Donsker-Varadhan formula (or other convex duality principles), and (d) Ville's inequality. Our main result is a PAC-Bayes theorem which holds for a wide class of discrete stochastic processes. We show how this result implies time-uniform versions of well-known classical PAC-Bayes bounds, such as those of Seeger, McAllester, Maurer, and Catoni, in addition to many recent bounds. We also present several novel bounds. Our framework also enables us to relax traditional assumptions; in particular, we consider nonstationary loss functions and non-i.i.d. data. In sum, we unify the derivation of past bounds and ease the search for future bounds: one may simply check if our supermartingale or submartingale conditions are met and, if so, be guaranteed a (time-uniform) PAC-Bayes bound.
format	Preprint
id	arxiv_https___arxiv_org_abs_2302_03421
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	A unified recipe for deriving (time-uniform) PAC-Bayes bounds Chugg, Ben Wang, Hongjian Ramdas, Aaditya Machine Learning Information Theory Statistics Theory We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike most previous literature on this topic, our bounds are anytime-valid (i.e., time-uniform), meaning that they hold at all stopping times, not only for a fixed sample size. Our approach combines four tools in the following order: (a) nonnegative supermartingales or reverse submartingales, (b) the method of mixtures, (c) the Donsker-Varadhan formula (or other convex duality principles), and (d) Ville's inequality. Our main result is a PAC-Bayes theorem which holds for a wide class of discrete stochastic processes. We show how this result implies time-uniform versions of well-known classical PAC-Bayes bounds, such as those of Seeger, McAllester, Maurer, and Catoni, in addition to many recent bounds. We also present several novel bounds. Our framework also enables us to relax traditional assumptions; in particular, we consider nonstationary loss functions and non-i.i.d. data. In sum, we unify the derivation of past bounds and ease the search for future bounds: one may simply check if our supermartingale or submartingale conditions are met and, if so, be guaranteed a (time-uniform) PAC-Bayes bound.
title	A unified recipe for deriving (time-uniform) PAC-Bayes bounds
topic	Machine Learning Information Theory Statistics Theory
url	https://arxiv.org/abs/2302.03421

Antzeko izenburuak