Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Huang, Jizhou, Juba, Brendan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Computational Complexity
Online Access:	https://arxiv.org/abs/2502.00172
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915132504801280
author	Huang, Jizhou Juba, Brendan
author_facet	Huang, Jizhou Juba, Brendan
contents	We study ``selective'' or ``conditional'' classification problems under an agnostic setting. Classification tasks commonly focus on modeling the relationship between features and categories that captures the vast majority of data. In contrast to common machine learning frameworks, conditional classification intends to model such relationships only on a subset of the data defined by some selection rule. Most work on conditional classification either solves the problem in a realizable setting or does not guarantee the error is bounded compared to an optimal solution. In this work, we consider selective/conditional classification by sparse linear classifiers for subsets defined by halfspaces, and give both positive as well as negative results for Gaussian feature distributions. On the positive side, we present the first PAC-learning algorithm for homogeneous halfspace selectors with error guarantee $\bigO*{\sqrt{\mathrm{opt}}}$, where $\mathrm{opt}$ is the smallest conditional classification error over the given class of classifiers and homogeneous halfspaces. On the negative side, we find that, under cryptographic assumptions, approximating the conditional classification loss within a small additive error is computationally hard even under Gaussian distribution. We prove that approximating conditional classification is at least as hard as approximating agnostic classification in both additive and multiplicative form.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_00172
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Distribution-Specific Agnostic Conditional Classification With Halfspaces Huang, Jizhou Juba, Brendan Machine Learning Computational Complexity We study ``selective'' or ``conditional'' classification problems under an agnostic setting. Classification tasks commonly focus on modeling the relationship between features and categories that captures the vast majority of data. In contrast to common machine learning frameworks, conditional classification intends to model such relationships only on a subset of the data defined by some selection rule. Most work on conditional classification either solves the problem in a realizable setting or does not guarantee the error is bounded compared to an optimal solution. In this work, we consider selective/conditional classification by sparse linear classifiers for subsets defined by halfspaces, and give both positive as well as negative results for Gaussian feature distributions. On the positive side, we present the first PAC-learning algorithm for homogeneous halfspace selectors with error guarantee $\bigO*{\sqrt{\mathrm{opt}}}$, where $\mathrm{opt}$ is the smallest conditional classification error over the given class of classifiers and homogeneous halfspaces. On the negative side, we find that, under cryptographic assumptions, approximating the conditional classification loss within a small additive error is computationally hard even under Gaussian distribution. We prove that approximating conditional classification is at least as hard as approximating agnostic classification in both additive and multiplicative form.
title	Distribution-Specific Agnostic Conditional Classification With Halfspaces
topic	Machine Learning Computational Complexity
url	https://arxiv.org/abs/2502.00172

Similar Items