Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Daunas, Francisco, Esnaola, Iñaki, Perlaza, Samir M., Poor, H. Vincent
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Information Theory
Online Access:	https://arxiv.org/abs/2402.00501
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909361102651392
author	Daunas, Francisco Esnaola, Iñaki Perlaza, Samir M. Poor, H. Vincent
author_facet	Daunas, Francisco Esnaola, Iñaki Perlaza, Samir M. Poor, H. Vincent
contents	The solution to empirical risk minimization with $f$-divergence regularization (ERM-$f$DR) is presented under mild conditions on $f$. Under such conditions, the optimal measure is shown to be unique. Examples of the solution for particular choices of the function $f$ are presented. Previously known solutions to common regularization choices are obtained by leveraging the flexibility of the family of $f$-divergences. These include the unique solutions to empirical risk minimization with relative entropy regularization (Type-I and Type-II). The analysis of the solution unveils the following properties of $f$-divergences when used in the ERM-$f$DR problem: $i\bigl)$ $f$-divergence regularization forces the support of the solution to coincide with the support of the reference measure, which introduces a strong inductive bias that dominates the evidence provided by the training data; and $ii\bigl)$ any $f$-divergence regularization is equivalent to a different $f$-divergence regularization with an appropriate transformation of the empirical risk function.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_00501
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Equivalence of the Empirical Risk Minimization to Regularization on the Family of f-Divergences Daunas, Francisco Esnaola, Iñaki Perlaza, Samir M. Poor, H. Vincent Machine Learning Information Theory The solution to empirical risk minimization with $f$-divergence regularization (ERM-$f$DR) is presented under mild conditions on $f$. Under such conditions, the optimal measure is shown to be unique. Examples of the solution for particular choices of the function $f$ are presented. Previously known solutions to common regularization choices are obtained by leveraging the flexibility of the family of $f$-divergences. These include the unique solutions to empirical risk minimization with relative entropy regularization (Type-I and Type-II). The analysis of the solution unveils the following properties of $f$-divergences when used in the ERM-$f$DR problem: $i\bigl)$ $f$-divergence regularization forces the support of the solution to coincide with the support of the reference measure, which introduces a strong inductive bias that dominates the evidence provided by the training data; and $ii\bigl)$ any $f$-divergence regularization is equivalent to a different $f$-divergence regularization with an appropriate transformation of the empirical risk function.
title	Equivalence of the Empirical Risk Minimization to Regularization on the Family of f-Divergences
topic	Machine Learning Information Theory
url	https://arxiv.org/abs/2402.00501

Similar Items