Saved in:
Bibliographic Details
Main Authors: Roy, Saptarshi, Wang, Zehua, Tewari, Ambuj
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2310.07852
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We consider the problem of model selection in a high-dimensional sparse linear regression model under privacy constraints. We propose a differentially private (DP) best subset selection method with strong statistical utility properties by adopting the well-known exponential mechanism for selecting the best model. To achieve computational expediency, we propose an efficient Metropolis-Hastings algorithm and under certain regularity conditions, we establish that it enjoys polynomial mixing time to its stationary distribution. As a result, we also establish both approximate differential privacy and statistical utility for the estimates of the mixed Metropolis-Hastings chain. Finally, we perform some illustrative experiments on simulated data showing that our algorithm can quickly identify active features under reasonable privacy budget constraints.