Saved in:
Bibliographic Details
Main Authors: Dann, Christoph, Mansour, Yishay, Mohri, Mehryar, Schneider, Jon, Sivan, Balasubramanian
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.07585
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909258790993920
author Dann, Christoph
Mansour, Yishay
Mohri, Mehryar
Schneider, Jon
Sivan, Balasubramanian
author_facet Dann, Christoph
Mansour, Yishay
Mohri, Mehryar
Schneider, Jon
Sivan, Balasubramanian
contents Abernethy et al. (2011) showed that Blackwell approachability and no-regret learning are equivalent, in the sense that any algorithm that solves a specific Blackwell approachability instance can be converted to a sublinear regret algorithm for a specific no-regret learning instance, and vice versa. In this paper, we study a more fine-grained form of such reductions, and ask when this translation between problems preserves not only a sublinear rate of convergence, but also preserves the optimal rate of convergence. That is, in which cases does it suffice to find the optimal regret bound for a no-regret learning instance in order to find the optimal rate of convergence for a corresponding approachability instance? We show that the reduction of Abernethy et al. (2011) does not preserve rates: their reduction may reduce a $d$-dimensional approachability instance $I_1$ with optimal convergence rate $R_1$ to a no-regret learning instance $I_2$ with optimal regret-per-round of $R_2$, with $R_{2}/R_{1}$ arbitrarily large (in particular, it is possible that $R_1 = 0$ and $R_{2} > 0$). On the other hand, we show that it is possible to tightly reduce any approachability instance to an instance of a generalized form of regret minimization we call improper $ϕ$-regret minimization (a variant of the $ϕ$-regret minimization of Gordon et al. (2008) where the transformation functions may map actions outside of the action set). Finally, we characterize when linear transformations suffice to reduce improper $ϕ$-regret minimization problems to standard classes of regret minimization problems in a rate preserving manner. We prove that some improper $ϕ$-regret minimization instances cannot be reduced to either subclass of instance in this way, suggesting that approachability can capture some problems that cannot be phrased in the language of online learning.
format Preprint
id arxiv_https___arxiv_org_abs_2406_07585
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Rate-Preserving Reductions for Blackwell Approachability
Dann, Christoph
Mansour, Yishay
Mohri, Mehryar
Schneider, Jon
Sivan, Balasubramanian
Machine Learning
Abernethy et al. (2011) showed that Blackwell approachability and no-regret learning are equivalent, in the sense that any algorithm that solves a specific Blackwell approachability instance can be converted to a sublinear regret algorithm for a specific no-regret learning instance, and vice versa. In this paper, we study a more fine-grained form of such reductions, and ask when this translation between problems preserves not only a sublinear rate of convergence, but also preserves the optimal rate of convergence. That is, in which cases does it suffice to find the optimal regret bound for a no-regret learning instance in order to find the optimal rate of convergence for a corresponding approachability instance? We show that the reduction of Abernethy et al. (2011) does not preserve rates: their reduction may reduce a $d$-dimensional approachability instance $I_1$ with optimal convergence rate $R_1$ to a no-regret learning instance $I_2$ with optimal regret-per-round of $R_2$, with $R_{2}/R_{1}$ arbitrarily large (in particular, it is possible that $R_1 = 0$ and $R_{2} > 0$). On the other hand, we show that it is possible to tightly reduce any approachability instance to an instance of a generalized form of regret minimization we call improper $ϕ$-regret minimization (a variant of the $ϕ$-regret minimization of Gordon et al. (2008) where the transformation functions may map actions outside of the action set). Finally, we characterize when linear transformations suffice to reduce improper $ϕ$-regret minimization problems to standard classes of regret minimization problems in a rate preserving manner. We prove that some improper $ϕ$-regret minimization instances cannot be reduced to either subclass of instance in this way, suggesting that approachability can capture some problems that cannot be phrased in the language of online learning.
title Rate-Preserving Reductions for Blackwell Approachability
topic Machine Learning
url https://arxiv.org/abs/2406.07585