Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nicholas D. Lane

FDAPT: Federated Domain-adaptive Pre-training for Language Models

Jul 12, 2023

Lekang Jiang, Filip Svoboda, Nicholas D. Lane

Figure 1 for FDAPT: Federated Domain-adaptive Pre-training for Language Models

Figure 2 for FDAPT: Federated Domain-adaptive Pre-training for Language Models

Abstract:Combining Domain-adaptive Pre-training (DAPT) with Federated Learning (FL) can enhance model adaptation by leveraging more sensitive and distributed data while preserving data privacy. However, few studies have focused on this method. Therefore, we conduct the first comprehensive empirical study to evaluate the performance of Federated Domain-adaptive Pre-training (FDAPT). We demonstrate that FDAPT can maintain competitive downstream task performance to the centralized baseline in both IID and non-IID situations. Furthermore, we propose a novel algorithm, Frozen Federated Domain-adaptive Pre-training (FFDAPT). FFDAPT improves the computational efficiency by 12.1% on average and exhibits similar downstream task performance to standard FDAPT, with general performance fluctuations remaining less than 1%. Finally, through a critical evaluation of our work, we identify promising future research directions for this new research area.

* 6 pages

Via

Access Paper or Ask Questions

FedVal: Different good or different bad in federated learning

Jun 06, 2023

Viktor Valadi, Xinchi Qiu, Pedro Porto Buarque de Gusmão, Nicholas D. Lane, Mina Alibeigi

Abstract:Federated learning (FL) systems are susceptible to attacks from malicious actors who might attempt to corrupt the training model through various poisoning attacks. FL also poses new challenges in addressing group bias, such as ensuring fair performance for different demographic groups. Traditional methods used to address such biases require centralized access to the data, which FL systems do not have. In this paper, we present a novel approach FedVal for both robustness and fairness that does not require any additional information from clients that could raise privacy concerns and consequently compromise the integrity of the FL system. To this end, we propose an innovative score function based on a server-side validation method that assesses client updates and determines the optimal aggregation balance between locally-trained models. Our research shows that this approach not only provides solid protection against poisoning attacks but can also be used to reduce group bias and subsequently promote fairness while maintaining the system's capability for differential privacy. Extensive experiments on the CIFAR-10, FEMNIST, and PUMS ACSIncome datasets in different configurations demonstrate the effectiveness of our method, resulting in state-of-the-art performances. We have proven robustness in situations where 80% of participating clients are malicious. Additionally, we have shown a significant increase in accuracy for underrepresented labels from 32% to 53%, and increase in recall rate for underrepresented features from 19% to 50%.

* To appear in the proceedings of the USENIX Security Symposium 2023

Via

Access Paper or Ask Questions

vFedSec: Efficient Secure Aggregation for Vertical Federated Learning via Secure Layer

May 26, 2023

Xinchi Qiu, Heng Pan, Wanru Zhao, Chenyang Ma, Pedro P. B. Gusmao, Nicholas D. Lane

Abstract:Most work in privacy-preserving federated learning (FL) has been focusing on horizontally partitioned datasets where clients share the same sets of features and can train complete models independently. However, in many interesting problems, individual data points are scattered across different clients/organizations in a vertical setting. Solutions for this type of FL require the exchange of intermediate outputs and gradients between participants, posing a potential risk of privacy leakage when privacy and security concerns are not considered. In this work, we present vFedSec - a novel design with an innovative Secure Layer for training vertical FL securely and efficiently using state-of-the-art security modules in secure aggregation. We theoretically demonstrate that our method does not impact the training performance while protecting private data effectively. Empirically results also show its applicability with extensive experiments that our design can achieve the protection with negligible computation and communication overhead. Also, our method can obtain 9.1e2 ~ 3.8e4 speedup compared to widely-adopted homomorphic encryption (HE) method.

* Generalised extension from our previous work: arXiv:2305.11236

Via

Access Paper or Ask Questions

Privacy in Multimodal Federated Human Activity Recognition

May 20, 2023

Alex Iacob, Pedro P. B. Gusmão, Nicholas D. Lane, Armand K. Koupai, Mohammud J. Bocus, Raúl Santos-Rodríguez, Robert J. Piechocki, Ryan McConville

Figure 1 for Privacy in Multimodal Federated Human Activity Recognition

Figure 2 for Privacy in Multimodal Federated Human Activity Recognition

Figure 3 for Privacy in Multimodal Federated Human Activity Recognition

Figure 4 for Privacy in Multimodal Federated Human Activity Recognition

Abstract:Human Activity Recognition (HAR) training data is often privacy-sensitive or held by non-cooperative entities. Federated Learning (FL) addresses such concerns by training ML models on edge clients. This work studies the impact of privacy in federated HAR at a user, environment, and sensor level. We show that the performance of FL for HAR depends on the assumed privacy level of the FL system and primarily upon the colocation of data from different sensors. By avoiding data sharing and assuming privacy at the human or environment level, as prior works have done, the accuracy decreases by 5-7%. However, extending this to the modality level and strictly separating sensor data between multiple clients may decrease the accuracy by 19-42%. As this form of privacy is necessary for the ethical utilisation of passive sensing methods in HAR, we implement a system where clients mutually train both a general FL model and a group-level one per modality. Our evaluation shows that this method leads to only a 7-13% decrease in accuracy, making it possible to build HAR systems with diverse hardware.

* In 3rd On-Device Intelligence Workshop at MLSys 2023, 8 pages

Via

Access Paper or Ask Questions

Efficient Vertical Federated Learning with Secure Aggregation

May 18, 2023

Xinchi Qiu, Heng Pan, Wanru Zhao, Chenyang Ma, Pedro Porto Buarque de Gusmão, Nicholas D. Lane

Figure 1 for Efficient Vertical Federated Learning with Secure Aggregation

Figure 2 for Efficient Vertical Federated Learning with Secure Aggregation

Figure 3 for Efficient Vertical Federated Learning with Secure Aggregation

Figure 4 for Efficient Vertical Federated Learning with Secure Aggregation

Abstract:The majority of work in privacy-preserving federated learning (FL) has been focusing on horizontally partitioned datasets where clients share the same sets of features and can train complete models independently. However, in many interesting problems, such as financial fraud detection and disease detection, individual data points are scattered across different clients/organizations in vertical federated learning. Solutions for this type of FL require the exchange of gradients between participants and rarely consider privacy and security concerns, posing a potential risk of privacy leakage. In this work, we present a novel design for training vertical FL securely and efficiently using state-of-the-art security modules for secure aggregation. We demonstrate empirically that our method does not impact training performance whilst obtaining 9.1e2 ~3.8e4 speedup compared to homomorphic encryption (HE).

* Federated Learning Systems (FLSys) Workshop @ MLSys 2023

Via

Access Paper or Ask Questions

Can Fair Federated Learning reduce the need for Personalisation?

May 04, 2023

Alex Iacob, Pedro P. B. Gusmão, Nicholas D. Lane

Abstract:Federated Learning (FL) enables training ML models on edge clients without sharing data. However, the federated model's performance on local data varies, disincentivising the participation of clients who benefit little from FL. Fair FL reduces accuracy disparity by focusing on clients with higher losses while personalisation locally fine-tunes the model. Personalisation provides a participation incentive when an FL model underperforms relative to one trained locally. For situations where the federated model provides a lower accuracy than a model trained entirely locally by a client, personalisation improves the accuracy of the pre-trained federated weights to be similar to or exceed those of the local client model. This paper evaluates two Fair FL (FFL) algorithms as starting points for personalisation. Our results show that FFL provides no benefit to relative performance in a language task and may double the number of underperforming clients for an image task. Instead, we propose Personalisation-aware Federated Learning (PaFL) as a paradigm that pre-emptively uses personalisation losses during training. Our technique shows a 50% reduction in the number of underperforming clients for the language task while lowering the number of underperforming clients in the image task instead of doubling it. Thus, evidence indicates that it may allow a broader set of devices to benefit from FL and represents a promising avenue for future experimentation and theoretical analysis.

* In 3rd Workshop on Machine Learning and Systems (EuroMLSys 2023), 9 pages

Via

Access Paper or Ask Questions

Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

May 02, 2023

Chenyang Ma, Xinchi Qiu, Daniel J. Beutel, Nicholas D. Lane

Figure 1 for Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

Figure 2 for Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

Figure 3 for Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

Figure 4 for Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

Abstract:The privacy-sensitive nature of decentralized datasets and the robustness of eXtreme Gradient Boosting (XGBoost) on tabular data raise the needs to train XGBoost in the context of federated learning (FL). Existing works on federated XGBoost in the horizontal setting rely on the sharing of gradients, which induce per-node level communication frequency and serious privacy concerns. To alleviate these problems, we develop an innovative framework for horizontal federated XGBoost which does not depend on the sharing of gradients and simultaneously boosts privacy and communication efficiency by making the learning rates of the aggregated tree ensembles learnable. We conduct extensive evaluations on various classification and regression datasets, showing our approach achieves performance comparable to the state-of-the-art method and effectively improves communication efficiency by lowering both communication rounds and communication overhead by factors ranging from 25x to 700x.

* Accepted at the 3rd ACM Workshop on Machine Learning and Systems (EuroMLSys), May 8th 2023, Rome, Italy

Via

Access Paper or Ask Questions

A Federated Learning Benchmark for Drug-Target Interaction

Feb 15, 2023

Filip Svoboda, Gianluca Mittone, Nicholas D. Lane, Pietro Lio

Abstract:Aggregating pharmaceutical data in the drug-target interaction (DTI) domain has the potential to deliver life-saving breakthroughs. It is, however, notoriously difficult due to regulatory constraints and commercial interests. This work proposes the application of federated learning, which we argue to be reconcilable with the industry's constraints, as it does not require sharing of any information that would reveal the entities' data or any other high-level summary of it. When used on a representative GraphDTA model and the KIBA dataset it achieves up to 15% improved performance relative to the best available non-privacy preserving alternative. Our extensive battery of experiments shows that, unlike in other domains, the non-IID data distribution in the DTI datasets does not deteriorate FL performance. Additionally, we identify a material trade-off between the benefits of adding new data, and the cost of adding more clients.

Via

Access Paper or Ask Questions

Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation

Dec 15, 2022

Royson Lee, Rui Li, Stylianos I. Venieris, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

Figure 1 for Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation

Figure 2 for Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation

Figure 3 for Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation

Figure 4 for Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation

Abstract:Recent image degradation estimation methods have enabled single-image super-resolution (SR) approaches to better upsample real-world images. Among these methods, explicit kernel estimation approaches have demonstrated unprecedented performance at handling unknown degradations. Nonetheless, a number of limitations constrain their efficacy when used by downstream SR models. Specifically, this family of methods yields i) excessive inference time due to long per-image adaptation times and ii) inferior image fidelity due to kernel mismatch. In this work, we introduce a learning-to-learn approach that meta-learns from the information contained in a distribution of images, thereby enabling significantly faster adaptation to new images with substantially improved performance in both kernel estimation and image fidelity. Specifically, we meta-train a kernel-generating GAN, named MetaKernelGAN, on a range of tasks, such that when a new image is presented, the generator starts from an informed kernel estimate and the discriminator starts with a strong capability to distinguish between patch distributions. Compared with state-of-the-art methods, our experiments show that MetaKernelGAN better estimates the magnitude and covariance of the kernel, leading to state-of-the-art blind SR results within a similar computational regime when combined with a non-blind SR model. Through supervised learning of an unsupervised learner, our method maintains the generalizability of the unsupervised learner, improves the optimization stability of kernel estimation, and hence image adaptation, and leads to a faster inference with a speedup between 14.24 to 102.1x over existing methods.

* Preprint: under review

Via

Access Paper or Ask Questions

NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

Dec 15, 2022

Stylianos I. Venieris, Mario Almeida, Royson Lee, Nicholas D. Lane

Figure 1 for NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

Figure 2 for NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

Figure 3 for NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

Figure 4 for NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

Abstract:In recent years, image and video delivery systems have begun integrating deep learning super-resolution (SR) approaches, leveraging their unprecedented visual enhancement capabilities while reducing reliance on networking conditions. Nevertheless, deploying these solutions on mobile devices still remains an active challenge as SR models are excessively demanding with respect to workload and memory footprint. Despite recent progress on on-device SR frameworks, existing systems either penalize visual quality, lead to excessive energy consumption or make inefficient use of the available resources. This work presents NAWQ-SR, a novel framework for the efficient on-device execution of SR models. Through a novel hybrid-precision quantization technique and a runtime neural image codec, NAWQ-SR exploits the multi-precision capabilities of modern mobile NPUs in order to minimize latency, while meeting user-specified quality constraints. Moreover, NAWQ-SR selectively adapts the arithmetic precision at run time to equip the SR DNN's layers with wider representational power, improving visual quality beyond what was previously possible on NPUs. Altogether, NAWQ-SR achieves an average speedup of 7.9x, 3x and 1.91x over the state-of-the-art on-device SR systems that use heterogeneous processors (MobiSR), CPU (SplitSR) and NPU (XLSR), respectively. Furthermore, NAWQ-SR delivers an average of 3.2x speedup and 0.39 dB higher PSNR over status-quo INT8 NPU designs, but most importantly mitigates the negative effects of quantization on visual quality, setting a new state-of-the-art in the attainable quality of NPU-based SR.

* Under review

Via

Access Paper or Ask Questions