MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Baby, Dheeraj, Han, Boran, Zhang, Shuai, Hu, Cuixiong, Wang, Yuyang, Wang, Yu-Xiang
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Machine Learning
Accesso online:	https://arxiv.org/abs/2504.07261
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866917981832871936
author	Baby, Dheeraj Han, Boran Zhang, Shuai Hu, Cuixiong Wang, Yuyang Wang, Yu-Xiang
author_facet	Baby, Dheeraj Han, Boran Zhang, Shuai Hu, Cuixiong Wang, Yuyang Wang, Yu-Xiang
contents	We study the well-motivated problem of online distribution shift in which the data arrive in batches and the distribution of each batch can change arbitrarily over time. Since the shifts can be large or small, abrupt or gradual, the length of the relevant historical data to learn from may vary over time, which poses a major challenge in designing algorithms that can automatically adapt to the best ``attention span'' while remaining computationally efficient. We propose a meta-algorithm that takes any network architecture and any Online Learner (OL) algorithm as input and produces a new algorithm which provably enhances the performance of the given OL under non-stationarity. Our algorithm is efficient (it requires maintaining only $O(\log(T))$ OL instances) and adaptive (it automatically chooses OL instances with the ideal ``attention'' length at every timestamp). Experiments on various real-world datasets across text and image modalities show that our method consistently improves the accuracy of user specified OL algorithms for classification tasks. Key novel algorithmic ingredients include a \emph{multi-resolution instance} design inspired by wavelet theory and a cross-validation-through-time technique. Both could be of independent interest.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_07261
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Adapting to Online Distribution Shifts in Deep Learning: A Black-Box Approach Baby, Dheeraj Han, Boran Zhang, Shuai Hu, Cuixiong Wang, Yuyang Wang, Yu-Xiang Machine Learning We study the well-motivated problem of online distribution shift in which the data arrive in batches and the distribution of each batch can change arbitrarily over time. Since the shifts can be large or small, abrupt or gradual, the length of the relevant historical data to learn from may vary over time, which poses a major challenge in designing algorithms that can automatically adapt to the best ``attention span'' while remaining computationally efficient. We propose a meta-algorithm that takes any network architecture and any Online Learner (OL) algorithm as input and produces a new algorithm which provably enhances the performance of the given OL under non-stationarity. Our algorithm is efficient (it requires maintaining only $O(\log(T))$ OL instances) and adaptive (it automatically chooses OL instances with the ideal ``attention'' length at every timestamp). Experiments on various real-world datasets across text and image modalities show that our method consistently improves the accuracy of user specified OL algorithms for classification tasks. Key novel algorithmic ingredients include a \emph{multi-resolution instance} design inspired by wavelet theory and a cross-validation-through-time technique. Both could be of independent interest.
title	Adapting to Online Distribution Shifts in Deep Learning: A Black-Box Approach
topic	Machine Learning
url	https://arxiv.org/abs/2504.07261

Documenti analoghi