Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Clément Pierquin

Privacy Amplification Through Synthetic Data: Insights from Linear Regression

Jun 05, 2025

Clément Pierquin, Aurélien Bellet, Marc Tommasi, Matthieu Boussard

Abstract:Synthetic data inherits the differential privacy guarantees of the model used to generate it. Additionally, synthetic data may benefit from privacy amplification when the generative model is kept hidden. While empirical studies suggest this phenomenon, a rigorous theoretical understanding is still lacking. In this paper, we investigate this question through the well-understood framework of linear regression. First, we establish negative results showing that if an adversary controls the seed of the generative model, a single synthetic data point can leak as much information as releasing the model itself. Conversely, we show that when synthetic data is generated from random inputs, releasing a limited number of synthetic data points amplifies privacy beyond the model's inherent guarantees. We believe our findings in linear regression can serve as a foundation for deriving more general bounds in the future.

* 26 pages, ICML 2025

Via

Access Paper or Ask Questions

Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration

Dec 21, 2023

Clément Pierquin, Aurélien Bellet, Marc Tommasi, Matthieu Boussard

Figure 1 for Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration

Figure 2 for Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration

Figure 3 for Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration

Figure 4 for Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration

Abstract:Pufferfish privacy is a flexible generalization of differential privacy that allows to model arbitrary secrets and adversary's prior knowledge about the data. Unfortunately, designing general and tractable Pufferfish mechanisms that do not compromise utility is challenging. Furthermore, this framework does not provide the composition guarantees needed for a direct use in iterative machine learning algorithms. To mitigate these issues, we introduce a R\'enyi divergence-based variant of Pufferfish and show that it allows us to extend the applicability of the Pufferfish framework. We first generalize the Wasserstein mechanism to cover a wide range of noise distributions and introduce several ways to improve its utility. We also derive stronger guarantees against out-of-distribution adversaries. Finally, as an alternative to composition, we prove privacy amplification results for contractive noisy iterations and showcase the first use of Pufferfish in private convex optimization. A common ingredient underlying our results is the use and extension of shift reduction lemmas.

Via

Access Paper or Ask Questions