Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Gajjar, Aarshvi, Tai, Wai Ming, Xu, Xingyu, Hegde, Chinmay, Li, Yi, Musco, Christopher
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2405.09312
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910521092997120
author	Gajjar, Aarshvi Tai, Wai Ming Xu, Xingyu Hegde, Chinmay Li, Yi Musco, Christopher
author_facet	Gajjar, Aarshvi Tai, Wai Ming Xu, Xingyu Hegde, Chinmay Li, Yi Musco, Christopher
contents	We study active learning methods for single index models of the form $F({\mathbf x}) = f(\langle {\mathbf w}, {\mathbf x}\rangle)$, where $f:\mathbb{R} \to \mathbb{R}$ and ${\mathbf x,\mathbf w} \in \mathbb{R}^d$. In addition to their theoretical interest as simple examples of non-linear neural networks, single index models have received significant recent attention due to applications in scientific machine learning like surrogate modeling for partial differential equations (PDEs). Such applications require sample-efficient active learning methods that are robust to adversarial noise. I.e., that work even in the challenging agnostic learning setting. We provide two main results on agnostic active learning of single index models. First, when $f$ is known and Lipschitz, we show that $\tilde{O}(d)$ samples collected via {statistical leverage score sampling} are sufficient to learn a near-optimal single index model. Leverage score sampling is simple to implement, efficient, and already widely used for actively learning linear models. Our result requires no assumptions on the data distribution, is optimal up to log factors, and improves quadratically on a recent ${O}(d^{2})$ bound of \cite{gajjar2023active}. Second, we show that $\tilde{O}(d)$ samples suffice even in the more difficult setting when $f$ is \emph{unknown}. Our results leverage tools from high dimensional probability, including Dudley's inequality and dual Sudakov minoration, as well as a novel, distribution-aware discretization of the class of Lipschitz functions.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_09312
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Agnostic Active Learning of Single Index Models with Linear Sample Complexity Gajjar, Aarshvi Tai, Wai Ming Xu, Xingyu Hegde, Chinmay Li, Yi Musco, Christopher Machine Learning We study active learning methods for single index models of the form $F({\mathbf x}) = f(\langle {\mathbf w}, {\mathbf x}\rangle)$, where $f:\mathbb{R} \to \mathbb{R}$ and ${\mathbf x,\mathbf w} \in \mathbb{R}^d$. In addition to their theoretical interest as simple examples of non-linear neural networks, single index models have received significant recent attention due to applications in scientific machine learning like surrogate modeling for partial differential equations (PDEs). Such applications require sample-efficient active learning methods that are robust to adversarial noise. I.e., that work even in the challenging agnostic learning setting. We provide two main results on agnostic active learning of single index models. First, when $f$ is known and Lipschitz, we show that $\tilde{O}(d)$ samples collected via {statistical leverage score sampling} are sufficient to learn a near-optimal single index model. Leverage score sampling is simple to implement, efficient, and already widely used for actively learning linear models. Our result requires no assumptions on the data distribution, is optimal up to log factors, and improves quadratically on a recent ${O}(d^{2})$ bound of \cite{gajjar2023active}. Second, we show that $\tilde{O}(d)$ samples suffice even in the more difficult setting when $f$ is \emph{unknown}. Our results leverage tools from high dimensional probability, including Dudley's inequality and dual Sudakov minoration, as well as a novel, distribution-aware discretization of the class of Lipschitz functions.
title	Agnostic Active Learning of Single Index Models with Linear Sample Complexity
topic	Machine Learning
url	https://arxiv.org/abs/2405.09312

Similar Items