Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zein, Dina El, Kumar, Shashi, Henderson, James
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.09583
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912974164197376
author	Zein, Dina El Kumar, Shashi Henderson, James
author_facet	Zein, Dina El Kumar, Shashi Henderson, James
contents	The nonparametric variational information bottleneck (NVIB) provides the foundation for nonparametric variational differential privacy (NVDP), a framework for building privacy-preserving language models. However, the learned latent representations can drift into regions with high information content, leading to poor privacy guarantees, but also low utility due to numerical instability during training. In this work, we introduce a principled parameter clipping strategy to directly address this issue. Our method is mathematically derived from the objective of minimizing the Rényi Divergence (RD) upper bound, yielding specific, theoretically grounded constraints on the posterior mean, variance, and mixture weight parameters. We apply our technique to an NVIB based model and empirically compare it against an unconstrained baseline. Our findings demonstrate that the clipped model consistently achieves tighter RD bounds, implying stronger privacy, while simultaneously attaining higher performance on several downstream tasks. This work presents a simple yet effective method for improving the privacy-utility trade-off in variational models, making them more robust and practical.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_09583
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Nonparametric Variational Differential Privacy via Embedding Parameter Clipping Zein, Dina El Kumar, Shashi Henderson, James Machine Learning The nonparametric variational information bottleneck (NVIB) provides the foundation for nonparametric variational differential privacy (NVDP), a framework for building privacy-preserving language models. However, the learned latent representations can drift into regions with high information content, leading to poor privacy guarantees, but also low utility due to numerical instability during training. In this work, we introduce a principled parameter clipping strategy to directly address this issue. Our method is mathematically derived from the objective of minimizing the Rényi Divergence (RD) upper bound, yielding specific, theoretically grounded constraints on the posterior mean, variance, and mixture weight parameters. We apply our technique to an NVIB based model and empirically compare it against an unconstrained baseline. Our findings demonstrate that the clipped model consistently achieves tighter RD bounds, implying stronger privacy, while simultaneously attaining higher performance on several downstream tasks. This work presents a simple yet effective method for improving the privacy-utility trade-off in variational models, making them more robust and practical.
title	Nonparametric Variational Differential Privacy via Embedding Parameter Clipping
topic	Machine Learning
url	https://arxiv.org/abs/2603.09583

Similar Items