Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Erchi, Zhu, Yuqing, Wang, Yu-Xiang
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.24737
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915313758502912
author	Wang, Erchi Zhu, Yuqing Wang, Yu-Xiang
author_facet	Wang, Erchi Zhu, Yuqing Wang, Yu-Xiang
contents	This paper studies the problem of differentially private empirical risk minimization (DP-ERM) for binary linear classification. We obtain an efficient $(\varepsilon,δ)$-DP algorithm with an empirical zero-one risk bound of $\tilde{O}\left(\frac{1}{γ^2\varepsilon n} + \frac{\|S_{\mathrm{out}}\|}{γn}\right)$ where $n$ is the number of data points, $S_{\mathrm{out}}$ is an arbitrary subset of data one can remove and $γ$ is the margin of linear separation of the remaining data points (after $S_{\mathrm{out}}$ is removed). Here, $\tilde{O}(\cdot)$ hides only logarithmic terms. In the agnostic case, we improve the existing results when the number of outliers is small. Our algorithm is highly adaptive because it does not require knowing the margin parameter $γ$ or outlier subset $S_{\mathrm{out}}$. We also derive a utility bound for the advanced private hyperparameter tuning algorithm.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_24737
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Adapting to Linear Separable Subsets with Large-Margin in Differentially Private Learning Wang, Erchi Zhu, Yuqing Wang, Yu-Xiang Machine Learning This paper studies the problem of differentially private empirical risk minimization (DP-ERM) for binary linear classification. We obtain an efficient $(\varepsilon,δ)$-DP algorithm with an empirical zero-one risk bound of $\tilde{O}\left(\frac{1}{γ^2\varepsilon n} + \frac{\|S_{\mathrm{out}}\|}{γn}\right)$ where $n$ is the number of data points, $S_{\mathrm{out}}$ is an arbitrary subset of data one can remove and $γ$ is the margin of linear separation of the remaining data points (after $S_{\mathrm{out}}$ is removed). Here, $\tilde{O}(\cdot)$ hides only logarithmic terms. In the agnostic case, we improve the existing results when the number of outliers is small. Our algorithm is highly adaptive because it does not require knowing the margin parameter $γ$ or outlier subset $S_{\mathrm{out}}$. We also derive a utility bound for the advanced private hyperparameter tuning algorithm.
title	Adapting to Linear Separable Subsets with Large-Margin in Differentially Private Learning
topic	Machine Learning
url	https://arxiv.org/abs/2505.24737

Similar Items