Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Dann, Christoph, Mansour, Yishay, Mohri, Mehryar, Schneider, Jon, Sivan, Balasubramanian
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2406.07585
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909258790993920
author	Dann, Christoph Mansour, Yishay Mohri, Mehryar Schneider, Jon Sivan, Balasubramanian
author_facet	Dann, Christoph Mansour, Yishay Mohri, Mehryar Schneider, Jon Sivan, Balasubramanian
contents	Abernethy et al. (2011) showed that Blackwell approachability and no-regret learning are equivalent, in the sense that any algorithm that solves a specific Blackwell approachability instance can be converted to a sublinear regret algorithm for a specific no-regret learning instance, and vice versa. In this paper, we study a more fine-grained form of such reductions, and ask when this translation between problems preserves not only a sublinear rate of convergence, but also preserves the optimal rate of convergence. That is, in which cases does it suffice to find the optimal regret bound for a no-regret learning instance in order to find the optimal rate of convergence for a corresponding approachability instance? We show that the reduction of Abernethy et al. (2011) does not preserve rates: their reduction may reduce a $d$-dimensional approachability instance $I_1$ with optimal convergence rate $R_1$ to a no-regret learning instance $I_2$ with optimal regret-per-round of $R_2$, with $R_{2}/R_{1}$ arbitrarily large (in particular, it is possible that $R_1 = 0$ and $R_{2} > 0$). On the other hand, we show that it is possible to tightly reduce any approachability instance to an instance of a generalized form of regret minimization we call improper $ϕ$-regret minimization (a variant of the $ϕ$-regret minimization of Gordon et al. (2008) where the transformation functions may map actions outside of the action set). Finally, we characterize when linear transformations suffice to reduce improper $ϕ$-regret minimization problems to standard classes of regret minimization problems in a rate preserving manner. We prove that some improper $ϕ$-regret minimization instances cannot be reduced to either subclass of instance in this way, suggesting that approachability can capture some problems that cannot be phrased in the language of online learning.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_07585
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Rate-Preserving Reductions for Blackwell Approachability Dann, Christoph Mansour, Yishay Mohri, Mehryar Schneider, Jon Sivan, Balasubramanian Machine Learning Abernethy et al. (2011) showed that Blackwell approachability and no-regret learning are equivalent, in the sense that any algorithm that solves a specific Blackwell approachability instance can be converted to a sublinear regret algorithm for a specific no-regret learning instance, and vice versa. In this paper, we study a more fine-grained form of such reductions, and ask when this translation between problems preserves not only a sublinear rate of convergence, but also preserves the optimal rate of convergence. That is, in which cases does it suffice to find the optimal regret bound for a no-regret learning instance in order to find the optimal rate of convergence for a corresponding approachability instance? We show that the reduction of Abernethy et al. (2011) does not preserve rates: their reduction may reduce a $d$-dimensional approachability instance $I_1$ with optimal convergence rate $R_1$ to a no-regret learning instance $I_2$ with optimal regret-per-round of $R_2$, with $R_{2}/R_{1}$ arbitrarily large (in particular, it is possible that $R_1 = 0$ and $R_{2} > 0$). On the other hand, we show that it is possible to tightly reduce any approachability instance to an instance of a generalized form of regret minimization we call improper $ϕ$-regret minimization (a variant of the $ϕ$-regret minimization of Gordon et al. (2008) where the transformation functions may map actions outside of the action set). Finally, we characterize when linear transformations suffice to reduce improper $ϕ$-regret minimization problems to standard classes of regret minimization problems in a rate preserving manner. We prove that some improper $ϕ$-regret minimization instances cannot be reduced to either subclass of instance in this way, suggesting that approachability can capture some problems that cannot be phrased in the language of online learning.
title	Rate-Preserving Reductions for Blackwell Approachability
topic	Machine Learning
url	https://arxiv.org/abs/2406.07585

Similar Items