Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Carlee Joe-Wong

HumekaFL: Automated Detection of Neonatal Asphyxia Using Federated Learning

Dec 02, 2024

Pamely Zantou, Blessed Guda, Bereket Retta, Gladys Inabeza, Carlee Joe-Wong, Assane Gueye

Figure 1 for HumekaFL: Automated Detection of Neonatal Asphyxia Using Federated Learning

Figure 2 for HumekaFL: Automated Detection of Neonatal Asphyxia Using Federated Learning

Figure 3 for HumekaFL: Automated Detection of Neonatal Asphyxia Using Federated Learning

Figure 4 for HumekaFL: Automated Detection of Neonatal Asphyxia Using Federated Learning

Abstract:Birth Apshyxia (BA) is a severe condition characterized by insufficient supply of oxygen to a newborn during the delivery. BA is one of the primary causes of neonatal death in the world. Although there has been a decline in neonatal deaths over the past two decades, the developing world, particularly sub-Saharan Africa, continues to experience the highest under-five (<5) mortality rates. While evidence-based methods are commonly used to detect BA in African healthcare settings, they can be subject to physician errors or delays in diagnosis, preventing timely interventions. Centralized Machine Learning (ML) methods demonstrated good performance in early detection of BA but require sensitive health data to leave their premises before training, which does not guarantee privacy and security. Healthcare institutions are therefore reluctant to adopt such solutions in Africa. To address this challenge, we suggest a federated learning (FL)-based software architecture, a distributed learning method that prioritizes privacy and security by design. We have developed a user-friendly and cost-effective mobile application embedding the FL pipeline for early detection of BA. Our Federated SVM model outperformed centralized SVM pipelines and Neural Networks (NN)-based methods in the existing literature

* Poster at ACM compass 2024

Via

Access Paper or Ask Questions

FedSPD: A Soft-clustering Approach for Personalized Decentralized Federated Learning

Oct 24, 2024

I-Cheng Lin, Osman Yagan, Carlee Joe-Wong

Figure 1 for FedSPD: A Soft-clustering Approach for Personalized Decentralized Federated Learning

Figure 2 for FedSPD: A Soft-clustering Approach for Personalized Decentralized Federated Learning

Figure 3 for FedSPD: A Soft-clustering Approach for Personalized Decentralized Federated Learning

Figure 4 for FedSPD: A Soft-clustering Approach for Personalized Decentralized Federated Learning

Abstract:Federated learning has recently gained popularity as a framework for distributed clients to collaboratively train a machine learning model using local data. While traditional federated learning relies on a central server for model aggregation, recent advancements adopt a decentralized framework, enabling direct model exchange between clients and eliminating the single point of failure. However, existing decentralized frameworks often assume all clients train a shared model. Personalizing each client's model can enhance performance, especially with heterogeneous client data distributions. We propose FedSPD, an efficient personalized federated learning algorithm for the decentralized setting, and show that it learns accurate models even in low-connectivity networks. To provide theoretical guarantees on convergence, we introduce a clustering-based framework that enables consensus on models for distinct data clusters while personalizing to unique mixtures of these clusters at different clients. This flexibility, allowing selective model updates based on data distribution, substantially reduces communication costs compared to prior work on personalized federated learning in decentralized settings. Experimental results on real-world datasets show that FedSPD outperforms multiple decentralized variants of personalized federated learning algorithms, especially in scenarios with low-connectivity networks.

Via

Access Paper or Ask Questions

FedBaF: Federated Learning Aggregation Biased by a Foundation Model

Oct 24, 2024

Jong-Ik Park, Srinivasa Pranav, José M. F. Moura, Carlee Joe-Wong

Abstract:Foundation models are now a major focus of leading technology organizations due to their ability to generalize across diverse tasks. Existing approaches for adapting foundation models to new applications often rely on Federated Learning (FL) and disclose the foundation model weights to clients when using it to initialize the global model. While these methods ensure client data privacy, they compromise model and information security. In this paper, we introduce Federated Learning Aggregation Biased by a Foundation Model (FedBaF), a novel method for dynamically integrating pre-trained foundation model weights during the FL aggregation phase. Unlike conventional methods, FedBaF preserves the confidentiality of the foundation model while still leveraging its power to train more accurate models, especially in non-IID and adversarial scenarios. Our comprehensive experiments use Pre-ResNet and foundation models like Vision Transformer to demonstrate that FedBaF not only matches, but often surpasses the test accuracy of traditional weight initialization methods by up to 11.4\% in IID and up to 15.8\% in non-IID settings. Additionally, FedBaF applied to a Transformer-based language model significantly reduced perplexity by up to 39.2\%.

Via

Access Paper or Ask Questions

Federated Communication-Efficient Multi-Objective Optimization

Oct 21, 2024

Baris Askin, Pranay Sharma, Gauri Joshi, Carlee Joe-Wong

Figure 1 for Federated Communication-Efficient Multi-Objective Optimization

Figure 2 for Federated Communication-Efficient Multi-Objective Optimization

Figure 3 for Federated Communication-Efficient Multi-Objective Optimization

Figure 4 for Federated Communication-Efficient Multi-Objective Optimization

Abstract:We study a federated version of multi-objective optimization (MOO), where a single model is trained to optimize multiple objective functions. MOO has been extensively studied in the centralized setting but is less explored in federated or distributed settings. We propose FedCMOO, a novel communication-efficient federated multi-objective optimization (FMOO) algorithm that improves the error convergence performance of the model compared to existing approaches. Unlike prior works, the communication cost of FedCMOO does not scale with the number of objectives, as each client sends a single aggregated gradient, obtained using randomized SVD (singular value decomposition), to the central server. We provide a convergence analysis of the proposed method for smooth non-convex objective functions under milder assumptions than in prior work. In addition, we introduce a variant of FedCMOO that allows users to specify a preference over the objectives in terms of a desired ratio of the final objective values. Through extensive experiments, we demonstrate the superiority of our proposed method over baseline approaches.

Via

Access Paper or Ask Questions

RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Oct 21, 2024

Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Carlee Joe-Wong, Gina Adam, Nathaniel D. Bastian, Tian Lan

Figure 1 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Figure 2 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Figure 3 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Figure 4 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Abstract:Deep Reinforcement Learning (DRL) algorithms have achieved great success in solving many challenging tasks while their black-box nature hinders interpretability and real-world applicability, making it difficult for human experts to interpret and understand DRL policies. Existing works on interpretable reinforcement learning have shown promise in extracting decision tree (DT) based policies from DRL policies with most focus on the single-agent settings while prior attempts to introduce DT policies in multi-agent scenarios mainly focus on heuristic designs which do not provide any quantitative guarantees on the expected return. In this paper, we establish an upper bound on the return gap between the oracle expert policy and an optimal decision tree policy. This enables us to recast the DT extraction problem into a novel non-euclidean clustering problem over the local observation and action values space of each agent, with action values as cluster labels and the upper bound on the return gap as clustering loss. Both the algorithm and the upper bound are extended to multi-agent decentralized DT extractions by an iteratively-grow-DT procedure guided by an action-value function conditioned on the current DTs of other agents. Further, we propose the Return-Gap-Minimization Decision Tree (RGMDT) algorithm, which is a surprisingly simple design and is integrated with reinforcement learning through the utilization of a novel Regularized Information Maximization loss. Evaluations on tasks like D4RL show that RGMDT significantly outperforms heuristic DT-based baselines and can achieve nearly optimal returns under given DT complexity constraints (e.g., maximum number of DT nodes).

Via

Access Paper or Ask Questions

Neural Combinatorial Clustered Bandits for Recommendation Systems

Oct 18, 2024

Baran Atalar, Carlee Joe-Wong

Figure 1 for Neural Combinatorial Clustered Bandits for Recommendation Systems

Figure 2 for Neural Combinatorial Clustered Bandits for Recommendation Systems

Figure 3 for Neural Combinatorial Clustered Bandits for Recommendation Systems

Figure 4 for Neural Combinatorial Clustered Bandits for Recommendation Systems

Abstract:We consider the contextual combinatorial bandit setting where in each round, the learning agent, e.g., a recommender system, selects a subset of "arms," e.g., products, and observes rewards for both the individual base arms, which are a function of known features (called "context"), and the super arm (the subset of arms), which is a function of the base arm rewards. The agent's goal is to simultaneously learn the unknown reward functions and choose the highest-reward arms. For example, the "reward" may represent a user's probability of clicking on one of the recommended products. Conventional bandit models, however, employ restrictive reward function models in order to obtain performance guarantees. We make use of deep neural networks to estimate and learn the unknown reward functions and propose Neural UCB Clustering (NeUClust), which adopts a clustering approach to select the super arm in every round by exploiting underlying structure in the context space. Unlike prior neural bandit works, NeUClust uses a neural network to estimate the super arm reward and select the super arm, thus eliminating the need for a known optimization oracle. We non-trivially extend prior neural combinatorial bandit works to prove that NeUClust achieves $\widetilde{O}\left(\widetilde{d}\sqrt{T}\right)$ regret, where $\widetilde{d}$ is the effective dimension of a neural tangent kernel matrix, $T$ the number of rounds. Experiments on real world recommendation datasets show that NeUClust achieves better regret and reward than other contextual combinatorial and neural bandit algorithms.

Via

Access Paper or Ask Questions

FedGraph: A Research Library and Benchmark for Federated Graph Learning

Oct 08, 2024

Yuhang Yao, Yuan Li, Xinyi Fan, Junhao Li, Kay Liu, Weizhao Jin, Srivatsan Ravi, Philip S. Yu, Carlee Joe-Wong

Figure 1 for FedGraph: A Research Library and Benchmark for Federated Graph Learning

Figure 2 for FedGraph: A Research Library and Benchmark for Federated Graph Learning

Figure 3 for FedGraph: A Research Library and Benchmark for Federated Graph Learning

Figure 4 for FedGraph: A Research Library and Benchmark for Federated Graph Learning

Abstract:Federated graph learning is an emerging field with significant practical challenges. While many algorithms have been proposed to enhance model accuracy, their system performance, crucial for real-world deployment, is often overlooked. To address this gap, we present FedGraph, a research library designed for practical distributed deployment and benchmarking in federated graph learning. FedGraph supports a range of state-of-the-art methods and includes profiling tools for system performance evaluation, focusing on communication and computation costs during training. FedGraph can then facilitate the development of practical applications and guide the design of future algorithms.

* https://github.com/FedGraph/fedgraph

Via

Access Paper or Ask Questions

Efficient Federated Learning against Heterogeneous and Non-stationary Client Unavailability

Sep 26, 2024

Ming Xiang, Stratis Ioannidis, Edmund Yeh, Carlee Joe-Wong, Lili Su

Figure 1 for Efficient Federated Learning against Heterogeneous and Non-stationary Client Unavailability

Figure 2 for Efficient Federated Learning against Heterogeneous and Non-stationary Client Unavailability

Figure 3 for Efficient Federated Learning against Heterogeneous and Non-stationary Client Unavailability

Figure 4 for Efficient Federated Learning against Heterogeneous and Non-stationary Client Unavailability

Abstract:Addressing intermittent client availability is critical for the real-world deployment of federated learning algorithms. Most prior work either overlooks the potential non-stationarity in the dynamics of client unavailability or requires substantial memory/computation overhead. We study federated learning in the presence of heterogeneous and non-stationary client availability, which may occur when the deployment environments are uncertain or the clients are mobile. The impacts of the heterogeneity and non-stationarity in client unavailability can be significant, as we illustrate using FedAvg, the most widely adopted federated learning algorithm. We propose FedAPM, which includes novel algorithmic structures that (i) compensate for missed computations due to unavailability with only $O(1)$ additional memory and computation with respect to standard FedAvg, and (ii) evenly diffuse local updates within the federated learning system through implicit gossiping, despite being agnostic to non-stationary dynamics. We show that FedAPM converges to a stationary point of even non-convex objectives while achieving the desired linear speedup property. We corroborate our analysis with numerical experiments over diversified client unavailability dynamics on real-world data sets.

* Accepted to NeurIPS 2024

Via

Access Paper or Ask Questions

QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling

Sep 21, 2024

Blessed Guda, Gabrial Zencha A., Lawrence Francis, Carlee Joe-Wong

Figure 1 for QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling

Figure 2 for QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling

Figure 3 for QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling

Figure 4 for QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling

Abstract:Large Language models (LLMs) have brought about substantial advancements in the field of Question Answering (QA) systems. These models do remarkably well in addressing intricate inquiries in a variety of disciplines. However, because of domain-specific vocabulary, complex technological concepts, and the requirement for exact responses applying LLMs to specialized sectors like telecommunications presents additional obstacles. GPT-3.5 has been used in recent work, to obtain noteworthy accuracy for telecom-related questions in a Retrieval Augmented Generation (RAG) framework. Notwithstanding these developments, the practical use of models such as GPT-3.5 is restricted by their proprietary nature and high computing demands. This paper introduces QMOS, an innovative approach which uses a Question-Masked loss and Option Shuffling trick to enhance the performance of LLMs in answering Multiple-Choice Questions in the telecommunications domain. Our focus was on using opensource, smaller language models (Phi-2 and Falcon-7B) within an enhanced RAG framework. Our multi-faceted approach involves several enhancements to the whole LLM-RAG pipeline of finetuning, retrieval, prompt engineering and inference. Our approaches significantly outperform existing results, achieving accuracy improvements from baselines of 24.70% to 49.30% with Falcon-7B and from 42.07% to 84.65% with Phi-2.

Via

Access Paper or Ask Questions

Federated Learning with Flexible Architectures

Jun 14, 2024

Jong-Ik Park, Carlee Joe-Wong

Figure 1 for Federated Learning with Flexible Architectures

Figure 2 for Federated Learning with Flexible Architectures

Figure 3 for Federated Learning with Flexible Architectures

Figure 4 for Federated Learning with Flexible Architectures

Abstract:Traditional federated learning (FL) methods have limited support for clients with varying computational and communication abilities, leading to inefficiencies and potential inaccuracies in model training. This limitation hinders the widespread adoption of FL in diverse and resource-constrained environments, such as those with client devices ranging from powerful servers to mobile devices. To address this need, this paper introduces Federated Learning with Flexible Architectures (FedFA), an FL training algorithm that allows clients to train models of different widths and depths. Each client can select a network architecture suitable for its resources, with shallower and thinner networks requiring fewer computing resources for training. Unlike prior work in this area, FedFA incorporates the layer grafting technique to align clients' local architectures with the largest network architecture in the FL system during model aggregation. Layer grafting ensures that all client contributions are uniformly integrated into the global model, thereby minimizing the risk of any individual client's data skewing the model's parameters disproportionately and introducing security benefits. Moreover, FedFA introduces the scalable aggregation method to manage scale variations in weights among different network architectures. Experimentally, FedFA outperforms previous width and depth flexible aggregation strategies. Furthermore, FedFA demonstrates increased robustness against performance degradation in backdoor attack scenarios compared to earlier strategies.

Via

Access Paper or Ask Questions