Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ghazi, Badih, Guzmán, Cristóbal, Kamath, Pritish, Knop, Alexander, Kumar, Ravi, Manurangsi, Pasin, Sachdeva, Sushant
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.14809
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916623110111232
author	Ghazi, Badih Guzmán, Cristóbal Kamath, Pritish Knop, Alexander Kumar, Ravi Manurangsi, Pasin Sachdeva, Sushant
author_facet	Ghazi, Badih Guzmán, Cristóbal Kamath, Pritish Knop, Alexander Kumar, Ravi Manurangsi, Pasin Sachdeva, Sushant
contents	We introduce $\mathsf{PREM}$ (Private Relative Error Multiplicative weight update), a new framework for generating synthetic data that achieves a relative error guarantee for statistical queries under $(\varepsilon, δ)$ differential privacy (DP). Namely, for a domain ${\cal X}$, a family ${\cal F}$ of queries $f : {\cal X} \to \{0, 1\}$, and $ζ> 0$, our framework yields a mechanism that on input dataset $D \in {\cal X}^n$ outputs a synthetic dataset $\widehat{D} \in {\cal X}^n$ such that all statistical queries in ${\cal F}$ on $D$, namely $\sum_{x \in D} f(x)$ for $f \in {\cal F}$, are within a $1 \pm ζ$ multiplicative factor of the corresponding value on $\widehat{D}$ up to an additive error that is polynomial in $\log \|{\cal F}\|$, $\log \|{\cal X}\|$, $\log n$, $\log(1/δ)$, $1/\varepsilon$, and $1/ζ$. In contrast, any $(\varepsilon, δ)$-DP mechanism is known to require worst-case additive error that is polynomial in at least one of $n, \|{\cal F}\|$, or $\|{\cal X}\|$. We complement our algorithm with nearly matching lower bounds.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_14809
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	PREM: Privately Answering Statistical Queries with Relative Error Ghazi, Badih Guzmán, Cristóbal Kamath, Pritish Knop, Alexander Kumar, Ravi Manurangsi, Pasin Sachdeva, Sushant Machine Learning We introduce $\mathsf{PREM}$ (Private Relative Error Multiplicative weight update), a new framework for generating synthetic data that achieves a relative error guarantee for statistical queries under $(\varepsilon, δ)$ differential privacy (DP). Namely, for a domain ${\cal X}$, a family ${\cal F}$ of queries $f : {\cal X} \to \{0, 1\}$, and $ζ> 0$, our framework yields a mechanism that on input dataset $D \in {\cal X}^n$ outputs a synthetic dataset $\widehat{D} \in {\cal X}^n$ such that all statistical queries in ${\cal F}$ on $D$, namely $\sum_{x \in D} f(x)$ for $f \in {\cal F}$, are within a $1 \pm ζ$ multiplicative factor of the corresponding value on $\widehat{D}$ up to an additive error that is polynomial in $\log \|{\cal F}\|$, $\log \|{\cal X}\|$, $\log n$, $\log(1/δ)$, $1/\varepsilon$, and $1/ζ$. In contrast, any $(\varepsilon, δ)$-DP mechanism is known to require worst-case additive error that is polynomial in at least one of $n, \|{\cal F}\|$, or $\|{\cal X}\|$. We complement our algorithm with nearly matching lower bounds.
title	PREM: Privately Answering Statistical Queries with Relative Error
topic	Machine Learning
url	https://arxiv.org/abs/2502.14809

Similar Items