Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cheng, Zhuoyu, Hatano, Kohei, Takimoto, Eiji
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.20734
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912803461267456
author	Cheng, Zhuoyu Hatano, Kohei Takimoto, Eiji
author_facet	Cheng, Zhuoyu Hatano, Kohei Takimoto, Eiji
contents	We consider a bandit optimization problem for nonconvex and non-smooth functions, where in each trial the loss function is the sum of a linear function and a small but arbitrary perturbation chosen after observing the player's choice. We give both expected and high probability regret bounds for the problem. Our result also implies an improved high-probability regret bound for the bandit linear optimization, a special case with no perturbation. We also give a lower bound on the expected regret.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_20734
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Adversarial bandit optimization for approximately linear functions Cheng, Zhuoyu Hatano, Kohei Takimoto, Eiji Machine Learning Artificial Intelligence We consider a bandit optimization problem for nonconvex and non-smooth functions, where in each trial the loss function is the sum of a linear function and a small but arbitrary perturbation chosen after observing the player's choice. We give both expected and high probability regret bounds for the problem. Our result also implies an improved high-probability regret bound for the bandit linear optimization, a special case with no perturbation. We also give a lower bound on the expected regret.
title	Adversarial bandit optimization for approximately linear functions
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2505.20734

Similar Items