Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vahid Tarokh

Duke University

DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

Sep 08, 2024

Qi Le, Enmao Diao, Xinran Wang, Vahid Tarokh, Jie Ding, Ali Anwar

Figure 1 for DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

Figure 2 for DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

Figure 3 for DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

Figure 4 for DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

Abstract:Federated Learning (FL) is a collaborative machine learning framework that allows multiple users to train models utilizing their local data in a distributed manner. However, considerable statistical heterogeneity in local data across devices often leads to suboptimal model performance compared with independently and identically distributed (IID) data scenarios. In this paper, we introduce DynamicFL, a new FL framework that investigates the trade-offs between global model performance and communication costs for two widely adopted FL methods: Federated Stochastic Gradient Descent (FedSGD) and Federated Averaging (FedAvg). Our approach allocates diverse communication resources to clients based on their data statistical heterogeneity, considering communication resource constraints, and attains substantial performance enhancements compared to uniform communication resource allocation. Notably, our method bridges the gap between FedSGD and FedAvg, providing a flexible framework leveraging communication heterogeneity to address statistical heterogeneity in FL. Through extensive experiments, we demonstrate that DynamicFL surpasses current state-of-the-art methods with up to a 10% increase in model accuracy, demonstrating its adaptability and effectiveness in tackling data statistical heterogeneity challenges.

Via

Access Paper or Ask Questions

Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

Jul 31, 2024

Patrick Kuiper, Ali Hasan, Wenhao Yang, Yuting Ng, Hoda Bidkhori, Jose Blanchet, Vahid Tarokh

Figure 1 for Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

Figure 2 for Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

Figure 3 for Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

Figure 4 for Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

Abstract:The goal of this paper is to develop distributionally robust optimization (DRO) estimators, specifically for multidimensional Extreme Value Theory (EVT) statistics. EVT supports using semi-parametric models called max-stable distributions built from spatial Poisson point processes. While powerful, these models are only asymptotically valid for large samples. However, since extreme data is by definition scarce, the potential for model misspecification error is inherent to these applications, thus DRO estimators are natural. In order to mitigate over-conservative estimates while enhancing out-of-sample performance, we study DRO estimators informed by semi-parametric max-stable constraints in the space of point processes. We study both tractable convex formulations for some problems of interest (e.g. CVaR) and more general neural network based estimators. Both approaches are validated using synthetically generated data, recovering prescribed characteristics, and verifying the efficacy of the proposed techniques. Additionally, the proposed method is applied to a real data set of financial returns for comparison to a previous analysis. We established the proposed model as a novel formulation in the multivariate EVT domain, and innovative with respect to performance when compared to relevant alternate proposals.

Via

Access Paper or Ask Questions

Generative Learning for Simulation of Vehicle Faults

Jul 30, 2024

Patrick Kuiper, Sirui Lin, Jose Blanchet, Vahid Tarokh

Abstract:We develop a novel generative model to simulate vehicle health and forecast faults, conditioned on practical operational considerations. The model, trained on data from the US Army's Predictive Logistics program, aims to support predictive maintenance. It forecasts faults far enough in advance to execute a maintenance intervention before a breakdown occurs. The model incorporates real-world factors that affect vehicle health. It also allows us to understand the vehicle's condition by analyzing operating data, and characterizing each vehicle into discrete states. Importantly, the model predicts the time to first fault with high accuracy. We compare its performance to other models and demonstrate its successful training.

Via

Access Paper or Ask Questions

Generative Learning for Simulation of US Army Vehicle Faults

Jul 24, 2024

Patrick Kuiper, Sirui Lin, Jose Blanchet, Vahid Tarokh

Via

Access Paper or Ask Questions

Base Models for Parabolic Partial Differential Equations

Jul 17, 2024

Xingzi Xu, Ali Hasan, Jie Ding, Vahid Tarokh

Abstract:Parabolic partial differential equations (PDEs) appear in many disciplines to model the evolution of various mathematical objects, such as probability flows, value functions in control theory, and derivative prices in finance. It is often necessary to compute the solutions or a function of the solutions to a parametric PDE in multiple scenarios corresponding to different parameters of this PDE. This process often requires resolving the PDEs from scratch, which is time-consuming. To better employ existing simulations for the PDEs, we propose a framework for finding solutions to parabolic PDEs across different scenarios by meta-learning an underlying base distribution. We build upon this base distribution to propose a method for computing solutions to parametric PDEs under different parameter settings. Finally, we illustrate the application of the proposed methods through extensive experiments in generative modeling, stochastic control, and finance. The empirical results suggest that the proposed approach improves generalization to solving PDEs under new parameter regimes.

* Appears in UAI 2024

Via

Access Paper or Ask Questions

Robust Score-Based Quickest Change Detection

Jul 15, 2024

Sean Moushegian, Suya Wu, Enmao Diao, Jie Ding, Taposh Banerjee, Vahid Tarokh

Figure 1 for Robust Score-Based Quickest Change Detection

Figure 2 for Robust Score-Based Quickest Change Detection

Figure 3 for Robust Score-Based Quickest Change Detection

Figure 4 for Robust Score-Based Quickest Change Detection

Abstract:Methods in the field of quickest change detection rapidly detect in real-time a change in the data-generating distribution of an online data stream. Existing methods have been able to detect this change point when the densities of the pre- and post-change distributions are known. Recent work has extended these results to the case where the pre- and post-change distributions are known only by their score functions. This work considers the case where the pre- and post-change score functions are known only to correspond to distributions in two disjoint sets. This work employs a pair of "least-favorable" distributions to robustify the existing score-based quickest change detection algorithm, the properties of which are studied. This paper calculates the least-favorable distributions for specific model classes and provides methods of estimating the least-favorable distributions for common constructions. Simulation results are provided demonstrating the performance of our robust change detection algorithm.

* arXiv admin note: text overlap with arXiv:2306.05091

Via

Access Paper or Ask Questions

RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

Jun 14, 2024

Shyam Venkatasubramanian, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

Figure 1 for RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

Figure 2 for RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

Figure 3 for RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

Figure 4 for RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

Abstract:This work presents a large-scale dataset for radar adaptive signal processing (RASP) applications, aimed at supporting the development of data-driven models within the radar community. The dataset, called RASPNet, consists of 100 realistic scenarios compiled over a variety of topographies and land types from across the contiguous United States, designed to reflect a diverse array of real-world environments. Within each scenario, RASPNet consists of 10,000 clutter realizations from an airborne radar setting, which can be utilized for radar algorithm development and evaluation. RASPNet intends to fill a prominent gap in the availability of a large-scale, realistic dataset that standardizes the evaluation of adaptive radar processing techniques. We describe its construction, organization, and several potential applications, which includes a transfer learning example to demonstrate how RASPNet can be leveraged for realistic adaptive radar processing scenarios.

Via

Access Paper or Ask Questions

ColA: Collaborative Adaptation with Gradient Learning

Apr 22, 2024

Enmao Diao, Qi Le, Suya Wu, Xinran Wang, Ali Anwar, Jie Ding, Vahid Tarokh

Figure 1 for ColA: Collaborative Adaptation with Gradient Learning

Figure 2 for ColA: Collaborative Adaptation with Gradient Learning

Figure 3 for ColA: Collaborative Adaptation with Gradient Learning

Figure 4 for ColA: Collaborative Adaptation with Gradient Learning

Abstract:A primary function of back-propagation is to compute both the gradient of hidden representations and parameters for optimization with gradient descent. Training large models requires high computational costs due to their vast parameter sizes. While Parameter-Efficient Fine-Tuning (PEFT) methods aim to train smaller auxiliary models to save computational space, they still present computational overheads, especially in Fine-Tuning as a Service (FTaaS) for numerous users. We introduce Collaborative Adaptation (ColA) with Gradient Learning (GL), a parameter-free, model-agnostic fine-tuning approach that decouples the computation of the gradient of hidden representations and parameters. In comparison to PEFT methods, ColA facilitates more cost-effective FTaaS by offloading the computation of the gradient to low-cost devices. We also provide a theoretical analysis of ColA and experimentally demonstrate that ColA can perform on par or better than existing PEFT methods on various benchmarks.

Via

Access Paper or Ask Questions

Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes

Apr 15, 2024

Haoming Yang, Ali Hasan, Yuting Ng, Vahid Tarokh

Abstract:McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, and corresponding estimators for inferring parameters from data based on the properties of the MV-SDE. We analyze the characteristics of the different architectures and estimators, and consider their applicability in relevant machine learning problems. We empirically compare the performance of the different architectures and estimators on real and synthetic datasets for time series and probabilistic modeling. The results suggest that explicitly including distributional dependence in the parameterization of the SDE is effective in modeling temporal data with interaction under an exchangeability assumption while maintaining strong performance for standard It\^o-SDEs due to the richer class of probability flows associated with MV-SDEs.

* Appears in AISTATS 2024

Via

Access Paper or Ask Questions

Large Deviation Analysis of Score-based Hypothesis Testing

Feb 03, 2024

Enmao Diao, Taposh Banerjee, Vahid Tarokh

Abstract:Score-based statistical models play an important role in modern machine learning, statistics, and signal processing. For hypothesis testing, a score-based hypothesis test is proposed in \cite{wu2022score}. We analyze the performance of this score-based hypothesis testing procedure and derive upper bounds on the probabilities of its Type I and II errors. We prove that the exponents of our error bounds are asymptotically (in the number of samples) tight for the case of simple null and alternative hypotheses. We calculate these error exponents explicitly in specific cases and provide numerical studies for various other scenarios of interest.

Via

Access Paper or Ask Questions