Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Massively Parallel Hyperparameter Tuning

Oct 17, 2018

Liam Li, Kevin Jamieson, Afshin Rostamizadeh, Ekaterina Gonina, Moritz Hardt, Benjamin Recht, Ameet Talwalkar

Figure 1 for Massively Parallel Hyperparameter Tuning

Figure 2 for Massively Parallel Hyperparameter Tuning

Figure 3 for Massively Parallel Hyperparameter Tuning

Figure 4 for Massively Parallel Hyperparameter Tuning

Share this with someone who'll enjoy it:

Abstract:Modern learning models are characterized by large hyperparameter spaces. In order to adequately explore these large spaces, we must evaluate a large number of configurations, typically orders of magnitude more configurations than available parallel workers. Given the growing costs of model training, we would ideally like to perform this search in roughly the same wall-clock time needed to train a single model. In this work, we tackle this challenge by introducing ASHA, a simple and robust hyperparameter tuning algorithm with solid theoretical underpinnings that exploits parallelism and aggressive early-stopping. Our extensive empirical results show that ASHA slightly outperforms Fabolas and Population Based Tuning, state-of-the hyperparameter tuning methods; scales linearly with the number of workers in distributed settings; converges to a high quality configuration in half the time taken by Vizier (Google's internal hyperparameter tuning service) in an experiment with 500 workers; and beats the published result for a near state-of-the-art LSTM architecture in under 2x the time to train a single model.

* Corrected typo in Algorithm 1

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Massively Parallel Hyperparameter Tuning

Paper and Code