Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Parikshit Ram

FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

Dec 15, 2021

Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst Samulowitz, Heiko Ludwig

Figure 1 for FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

Figure 2 for FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

Figure 3 for FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

Figure 4 for FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

Abstract:We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss suRface Aggregation (FLoRA), the first FL-HPO solution framework that can address use cases of tabular data and gradient boosting training algorithms in addition to stochastic gradient descent/neural networks commonly addressed in the FL literature. The framework enables single-shot FL-HPO, by first identifying a good set of hyper-parameters that are used in a **single** FL training. Thus, it enables FL-HPO solutions with minimal additional communication overhead compared to FL training without HPO. Our empirical evaluation of FLoRA for Gradient Boosted Decision Trees on seven OpenML data sets demonstrates significant model accuracy improvements over the considered baseline, and robustness to increasing number of parties involved in FL-HPO training.

Via

Access Paper or Ask Questions

Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

Dec 14, 2021

Parikshit Ram, Kaushik Sinha

Figure 1 for Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

Figure 2 for Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

Figure 3 for Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

Figure 4 for Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

Abstract:The mathematical formalization of a neurological mechanism in the olfactory circuit of a fruit-fly as a locality sensitive hash (Flyhash) and bloom filter (FBF) has been recently proposed and "reprogrammed" for various machine learning tasks such as similarity search, outlier detection and text embeddings. We propose a novel reprogramming of this hash and bloom filter to emulate the canonical nearest neighbor classifier (NNC) in the challenging Federated Learning (FL) setup where training and test data are spread across parties and no data can leave their respective parties. Specifically, we utilize Flyhash and FBF to create the FlyNN classifier, and theoretically establish conditions where FlyNN matches NNC. We show how FlyNN is trained exactly in a FL setup with low communication overhead to produce FlyNNFL, and how it can be differentially private. Empirically, we demonstrate that (i) FlyNN matches NNC accuracy across 70 OpenML datasets, (ii) FlyNNFL training is highly scalable with low communication overhead, providing up to $8\times$ speedup with $16$ parties.

* A extended version of the original paper with detailed supplementary materials (21 pages, 17 figures)

Via

Access Paper or Ask Questions

Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

Sep 15, 2021

Chen Fan, Parikshit Ram, Sijia Liu

Figure 1 for Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

Figure 2 for Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

Figure 3 for Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

Figure 4 for Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

Abstract:We propose a new computationally-efficient first-order algorithm for Model-Agnostic Meta-Learning (MAML). The key enabling technique is to interpret MAML as a bilevel optimization (BLO) problem and leverage the sign-based SGD(signSGD) as a lower-level optimizer of BLO. We show that MAML, through the lens of signSGD-oriented BLO, naturally yields an alternating optimization scheme that just requires first-order gradients of a learned meta-model. We term the resulting MAML algorithm Sign-MAML. Compared to the conventional first-order MAML (FO-MAML) algorithm, Sign-MAML is theoretically-grounded as it does not impose any assumption on the absence of second-order derivatives during meta training. In practice, we show that Sign-MAML outperforms FO-MAML in various few-shot image classification tasks, and compared to MAML, it achieves a much more graceful tradeoff between classification accuracy and computation efficiency.

Via

Access Paper or Ask Questions

Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Jul 14, 2021

Paulito P. Palmes, Akihiro Kishimoto, Radu Marinescu, Parikshit Ram, Elizabeth Daly

Figure 1 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Figure 2 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Figure 3 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Figure 4 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Abstract:The pipeline optimization problem in machine learning requires simultaneous optimization of pipeline structures and parameter adaptation of their elements. Having an elegant way to express these structures can help lessen the complexity in the management and analysis of their performances together with the different choices of optimization strategies. With these issues in mind, we created the AutoMLPipeline (AMLP) toolkit which facilitates the creation and evaluation of complex machine learning pipeline structures using simple expressions. We use AMLP to find optimal pipeline signatures, datamine them, and use these datamined features to speed-up learning and prediction. We formulated a two-stage pipeline optimization with surrogate modeling in AMLP which outperforms other AutoML approaches with a 4-hour time budget in less than 5 minutes of AMLP computation time.

Via

Access Paper or Ask Questions

Learned Fine-Tuner for Incongruous Few-Shot Learning

Oct 20, 2020

Pu Zhao, Sijia Liu, Parikshit Ram, Songtao Lu, Djallel Bouneffouf, Xue Lin

Figure 1 for Learned Fine-Tuner for Incongruous Few-Shot Learning

Figure 2 for Learned Fine-Tuner for Incongruous Few-Shot Learning

Figure 3 for Learned Fine-Tuner for Incongruous Few-Shot Learning

Figure 4 for Learned Fine-Tuner for Incongruous Few-Shot Learning

Abstract:Model-agnostic meta-learning (MAML) effectively meta-learns an initialization of model parameters for few-shot learning where all learning problems share the same format of model parameters -- congruous meta-learning. We extend MAML to incongruous meta-learning where different yet related few-shot learning problems may not share any model parameters. A Learned Fine Tuner (LFT) is used to replace hand-designed optimizers such as SGD for the task-specific fine-tuning. Here, MAML instead meta-learns the parameters of this LFT across incongruous tasks leveraging the learning-to-optimize (L2O) framework such that models fine-tuned with LFT (even from random initializations) adapt quickly to new tasks. As novel contributions, we show that the use of LFT within MAML (i) offers the capability to tackle few-shot learning tasks by meta-learning across incongruous yet related problems (e.g., classification over images of different sizes and model architectures), and (ii) can efficiently work with first-order and derivative-free few-shot learning problems. Theoretically, we quantify the difference between LFT (for MAML) and L2O. Empirically, we demonstrate the effectiveness of LFT through both synthetic and real problems and a novel application of generating universal adversarial attacks across different image sources in the few-shot learning regime.

Via

Access Paper or Ask Questions

Neural Neighborhood Encoding for Classification

Aug 19, 2020

Kaushik Sinha, Parikshit Ram

Figure 1 for Neural Neighborhood Encoding for Classification

Figure 2 for Neural Neighborhood Encoding for Classification

Figure 3 for Neural Neighborhood Encoding for Classification

Figure 4 for Neural Neighborhood Encoding for Classification

Abstract:Inspired by the fruit-fly olfactory circuit, the Fly Bloom Filter [Dasgupta et al., 2018] is able to efficiently summarize the data with a single pass and has been used for novelty detection. We propose a new classifier (for binary and multi-class classification) that effectively encodes the different local neighborhoods for each class with a per-class Fly Bloom Filter. The inference on test data requires an efficient {\tt FlyHash} [Dasgupta, et al., 2017] operation followed by a high-dimensional, but {\em sparse}, dot product with the per-class Bloom Filters. The learning is trivially parallelizable. On the theoretical side, we establish conditions under which the prediction of our proposed classifier on any test example agrees with the prediction of the nearest neighbor classifier with high probability. We extensively evaluate our proposed scheme with over $50$ data sets of varied data dimensionality to demonstrate that the predictive performance of our proposed neuroscience inspired classifier is competitive the the nearest-neighbor classifiers and other single-pass classifiers.

Via

Access Paper or Ask Questions

Solving Constrained CASH Problems with ADMM

Jul 11, 2020

Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

Figure 1 for Solving Constrained CASH Problems with ADMM

Figure 2 for Solving Constrained CASH Problems with ADMM

Figure 3 for Solving Constrained CASH Problems with ADMM

Figure 4 for Solving Constrained CASH Problems with ADMM

Abstract:The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available. However, CASH solvers do not directly handle black-box constraints such as fairness, robustness or other domain-specific custom constraints. We present our recent approach [Liu, et al., 2020] that leverages the ADMM optimization framework to decompose CASH into multiple small problems and demonstrate how ADMM facilitates incorporation of black-box constraints.

* 7th ICML Workshop on Automated Machine Learning (2020)

Via

Access Paper or Ask Questions

Lale: Consistent Automated Machine Learning

Jul 04, 2020

Guillaume Baudart, Martin Hirzel, Kiran Kate, Parikshit Ram, Avraham Shinnar

Figure 1 for Lale: Consistent Automated Machine Learning

Figure 2 for Lale: Consistent Automated Machine Learning

Figure 3 for Lale: Consistent Automated Machine Learning

Figure 4 for Lale: Consistent Automated Machine Learning

Abstract:Automated machine learning makes it easier for data scientists to develop pipelines by searching over possible choices for hyperparameters, algorithms, and even pipeline topologies. Unfortunately, the syntax for automated machine learning tools is inconsistent with manual machine learning, with each other, and with error checks. Furthermore, few tools support advanced features such as topology search or higher-order operators. This paper introduces Lale, a library of high-level Python interfaces that simplifies and unifies automated machine learning in a consistent way.

* KDD Workshop on Automation in Machine Learning (AutoML@KDD), August 2020

Via

Access Paper or Ask Questions

How can AI Automate End-to-End Data Science?

Oct 22, 2019

Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat(+2 more)

Figure 1 for How can AI Automate End-to-End Data Science?

Abstract:Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it. This makes data science time consuming and restricted to experts with the resulting quality heavily dependent on their experience and skills. To make data science more accessible and scalable, we need its democratization. Automated Data Science (AutoDS) is aimed towards that goal and is emerging as an important research and business topic. We introduce and define the AutoDS challenge, followed by a proposal of a general AutoDS framework that covers existing approaches but also provides guidance for the development of new methods. We categorize and review the existing literature from multiple aspects of the problem setup and employed techniques. Then we provide several views on how AI could succeed in automating end-to-end AutoDS. We hope this survey can serve as insightful guideline for the AutoDS field and provide inspiration for future research.

Via

Access Paper or Ask Questions

Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI

Sep 05, 2019

Dakuo Wang, Justin D. Weisz, Michael Muller, Parikshit Ram, Werner Geyer, Casey Dugan, Yla Tausczik, Horst Samulowitz, Alexander Gray

Figure 1 for Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI

Figure 2 for Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI

Figure 3 for Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI

Abstract:The rapid advancement of artificial intelligence (AI) is changing our lives in many ways. One application domain is data science. New techniques in automating the creation of AI, known as AutoAI or AutoML, aim to automate the work practices of data scientists. AutoAI systems are capable of autonomously ingesting and pre-processing data, engineering new features, and creating and scoring models based on a target objectives (e.g. accuracy or run-time efficiency). Though not yet widely adopted, we are interested in understanding how AutoAI will impact the practice of data science. We conducted interviews with 20 data scientists who work at a large, multinational technology company and practice data science in various business settings. Our goal is to understand their current work practices and how these practices might change with AutoAI. Reactions were mixed: while informants expressed concerns about the trend of automating their jobs, they also strongly felt it was inevitable. Despite these concerns, they remained optimistic about their future job security due to a view that the future of data science work will be a collaboration between humans and AI systems, in which both automation and human expertise are indispensable.

Via

Access Paper or Ask Questions