Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lihua Yin

Private and Communication-Efficient Federated Learning based on Differentially Private Sketches

Oct 10, 2024

Meifan Zhang, Zhanhong Xie, Lihua Yin

Figure 1 for Private and Communication-Efficient Federated Learning based on Differentially Private Sketches

Figure 2 for Private and Communication-Efficient Federated Learning based on Differentially Private Sketches

Figure 3 for Private and Communication-Efficient Federated Learning based on Differentially Private Sketches

Figure 4 for Private and Communication-Efficient Federated Learning based on Differentially Private Sketches

Abstract:Federated learning (FL) faces two primary challenges: the risk of privacy leakage due to parameter sharing and communication inefficiencies. To address these challenges, we propose DPSFL, a federated learning method that utilizes differentially private sketches. DPSFL compresses the local gradients of each client using a count sketch, thereby improving communication efficiency, while adding noise to the sketches to ensure differential privacy (DP). We provide a theoretical analysis of privacy and convergence for the proposed method. Gradient clipping is essential in DP learning to limit sensitivity and constrain the addition of noise. However, clipping introduces bias into the gradients, negatively impacting FL performance. To mitigate the impact of clipping, we propose an enhanced method, DPSFL-AC, which employs an adaptive clipping strategy. Experimental comparisons with existing techniques demonstrate the superiority of our methods concerning privacy preservation, communication efficiency, and model accuracy.

Via

Access Paper or Ask Questions

Partition-based differentially private synthetic data generation

Oct 10, 2023

Meifan Zhang, Dihang Deng, Lihua Yin

Abstract:Private synthetic data sharing is preferred as it keeps the distribution and nuances of original data compared to summary statistics. The state-of-the-art methods adopt a select-measure-generate paradigm, but measuring large domain marginals still results in much error and allocating privacy budget iteratively is still difficult. To address these issues, our method employs a partition-based approach that effectively reduces errors and improves the quality of synthetic data, even with a limited privacy budget. Results from our experiments demonstrate the superiority of our method over existing approaches. The synthetic data produced using our approach exhibits improved quality and utility, making it a preferable choice for private synthetic data sharing.

Via

Access Paper or Ask Questions