Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Block, Adam, Rakhlin, Alexander, Simchowitz, Max
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2302.05430
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914719152996352
author	Block, Adam Rakhlin, Alexander Simchowitz, Max
author_facet	Block, Adam Rakhlin, Alexander Simchowitz, Max
contents	Smoothed online learning has emerged as a popular framework to mitigate the substantial loss in statistical and computational complexity that arises when one moves from classical to adversarial learning. Unfortunately, for some spaces, it has been shown that efficient algorithms suffer an exponentially worse regret than that which is minimax optimal, even when the learner has access to an optimization oracle over the space. To mitigate that exponential dependence, this work introduces a new notion of complexity, the generalized bracketing numbers, which marries constraints on the adversary to the size of the space, and shows that an instantiation of Follow-the-Perturbed-Leader can attain low regret with the number of calls to the optimization oracle scaling optimally with respect to average regret. We then instantiate our bounds in several problems of interest, including online prediction and planning of piecewise continuous functions, which has many applications in fields as diverse as econometrics and robotics.
format	Preprint
id	arxiv_https___arxiv_org_abs_2302_05430
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Oracle-Efficient Smoothed Online Learning for Piecewise Continuous Decision Making Block, Adam Rakhlin, Alexander Simchowitz, Max Machine Learning Smoothed online learning has emerged as a popular framework to mitigate the substantial loss in statistical and computational complexity that arises when one moves from classical to adversarial learning. Unfortunately, for some spaces, it has been shown that efficient algorithms suffer an exponentially worse regret than that which is minimax optimal, even when the learner has access to an optimization oracle over the space. To mitigate that exponential dependence, this work introduces a new notion of complexity, the generalized bracketing numbers, which marries constraints on the adversary to the size of the space, and shows that an instantiation of Follow-the-Perturbed-Leader can attain low regret with the number of calls to the optimization oracle scaling optimally with respect to average regret. We then instantiate our bounds in several problems of interest, including online prediction and planning of piecewise continuous functions, which has many applications in fields as diverse as econometrics and robotics.
title	Oracle-Efficient Smoothed Online Learning for Piecewise Continuous Decision Making
topic	Machine Learning
url	https://arxiv.org/abs/2302.05430

Similar Items