Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aaron Johnson

Differentially Private Maximal Information Coefficients

Jun 21, 2022

John Lazarsfeld, Aaron Johnson, Emmanuel Adeniran

Figure 1 for Differentially Private Maximal Information Coefficients

Figure 2 for Differentially Private Maximal Information Coefficients

Figure 3 for Differentially Private Maximal Information Coefficients

Figure 4 for Differentially Private Maximal Information Coefficients

Abstract:The Maximal Information Coefficient (MIC) is a powerful statistic to identify dependencies between variables. However, it may be applied to sensitive data, and publishing it could leak private information. As a solution, we present algorithms to approximate MIC in a way that provides differential privacy. We show that the natural application of the classic Laplace mechanism yields insufficient accuracy. We therefore introduce the MICr statistic, which is a new MIC approximation that is more compatible with differential privacy. We prove MICr is a consistent estimator for MIC, and we provide two differentially private versions of it. We perform experiments on a variety of real and synthetic datasets. The results show that the private MICr statistics significantly outperform direct application of the Laplace mechanism. Moreover, experiments on real-world datasets show accuracy that is usable when the sample size is at least moderately large.

* 38 pages, to appear in ICML 2022

Via

Access Paper or Ask Questions

Consistency of the Maximal Information Coefficient Estimator

Jul 08, 2021

John Lazarsfeld, Aaron Johnson

Abstract:The Maximal Information Coefficient (MIC) of Reshef et al. (Science, 2011) is a statistic for measuring dependence between variable pairs in large datasets. In this note, we prove that MIC is a consistent estimator of the corresponding population statistic MIC$_*$. This corrects an error in an argument of Reshef et al. (JMLR, 2016), which we describe.

Via

Access Paper or Ask Questions