Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On Privacy Protection of Latent Dirichlet Allocation Model Training

Jun 04, 2019

Fangyuan Zhao, Xuebin Ren, Shusen Yang, Xinyu Yang

Figure 1 for On Privacy Protection of Latent Dirichlet Allocation Model Training

Figure 2 for On Privacy Protection of Latent Dirichlet Allocation Model Training

Figure 3 for On Privacy Protection of Latent Dirichlet Allocation Model Training

Share this with someone who'll enjoy it:

Abstract:Latent Dirichlet Allocation (LDA) is a popular topic modeling technique for discovery of hidden semantic architecture of text datasets, and plays a fundamental role in many machine learning applications. However, like many other machine learning algorithms, the process of training a LDA model may leak the sensitive information of the training datasets and bring significant privacy risks. To mitigate the privacy issues in LDA, we focus on studying privacy-preserving algorithms of LDA model training in this paper. In particular, we first develop a privacy monitoring algorithm to investigate the privacy guarantee obtained from the inherent randomness of the Collapsed Gibbs Sampling (CGS) process in a typical LDA training algorithm on centralized curated datasets. Then, we further propose a locally private LDA training algorithm on crowdsourced data to provide local differential privacy for individual data contributors. The experimental results on real-world datasets demonstrate the effectiveness of our proposed algorithms.

* 8 pages,5 figures,and is published in International Joint Conferences on Artificial Intelligence

View paper on

Share this with someone who'll enjoy it:

Title:On Privacy Protection of Latent Dirichlet Allocation Model Training

Paper and Code