Saved in:
Bibliographic Details
Main Authors: Hu, Shenggang, Aslett, Louis, Dai, Hongsheng, Pollock, Murray, Roberts, Gareth O.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.07772
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911441470095360
author Hu, Shenggang
Aslett, Louis
Dai, Hongsheng
Pollock, Murray
Roberts, Gareth O.
author_facet Hu, Shenggang
Aslett, Louis
Dai, Hongsheng
Pollock, Murray
Roberts, Gareth O.
contents In recent years, differential privacy has been adopted by tech-companies and governmental agencies as the standard for measuring privacy in algorithms. In this article, we study differential privacy in Bayesian posterior sampling settings. We begin by considering differential privacy in the most common privatisation setting in which Laplace or Gaussian noise is injected into the output. In an effort to achieve better differential privacy, we consider adopting {\em Huber's contamination model} for use within privacy settings, and replace at random data points with samples from a heavy-tailed distribution ({\em instead} of injecting noise into the output). We derive bounds for the differential privacy level $(ε,δ)$ of our approach, without requiring bounded observation and parameter spaces, a restriction commonly imposed in the literature. We further consider for our approach the effect of sample size on the privacy level and the rate at which $(ε,δ)$ converges to zero. Asymptotically, our contamination approach is fully private with no information loss. We also provide examples of inference models for which our approach applies, with theoretical convergence rate analysis and simulation studies.
format Preprint
id arxiv_https___arxiv_org_abs_2403_07772
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Privacy Guarantees in Posterior Sampling under Contamination
Hu, Shenggang
Aslett, Louis
Dai, Hongsheng
Pollock, Murray
Roberts, Gareth O.
Statistics Theory
62F15, 62J12
In recent years, differential privacy has been adopted by tech-companies and governmental agencies as the standard for measuring privacy in algorithms. In this article, we study differential privacy in Bayesian posterior sampling settings. We begin by considering differential privacy in the most common privatisation setting in which Laplace or Gaussian noise is injected into the output. In an effort to achieve better differential privacy, we consider adopting {\em Huber's contamination model} for use within privacy settings, and replace at random data points with samples from a heavy-tailed distribution ({\em instead} of injecting noise into the output). We derive bounds for the differential privacy level $(ε,δ)$ of our approach, without requiring bounded observation and parameter spaces, a restriction commonly imposed in the literature. We further consider for our approach the effect of sample size on the privacy level and the rate at which $(ε,δ)$ converges to zero. Asymptotically, our contamination approach is fully private with no information loss. We also provide examples of inference models for which our approach applies, with theoretical convergence rate analysis and simulation studies.
title Privacy Guarantees in Posterior Sampling under Contamination
topic Statistics Theory
62F15, 62J12
url https://arxiv.org/abs/2403.07772