Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael Schaarschmidt

Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization

Dec 01, 2016

Valentin Dalibard, Michael Schaarschmidt, Eiko Yoneki

Figure 1 for Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization

Figure 2 for Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization

Figure 3 for Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization

Figure 4 for Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization

Abstract:We present an optimizer which uses Bayesian optimization to tune the system parameters of distributed stochastic gradient descent (SGD). Given a specific context, our goal is to quickly find efficient configurations which appropriately balance the load between the available machines to minimize the average SGD iteration time. Our experiments consider setups with over thirty parameters. Traditional Bayesian optimization, which uses a Gaussian process as its model, is not well suited to such high dimensional domains. To reduce convergence time, we exploit the available structure. We design a probabilistic model which simulates the behavior of distributed SGD and use it within Bayesian optimization. Our model can exploit many runtime measurements for inference per evaluation of the objective function. Our experiments show that our resulting optimizer converges to efficient configurations within ten iterations, the optimized configurations outperform those found by generic optimizer in thirty iterations by up to 2X.

Via

Access Paper or Ask Questions

Learning Runtime Parameters in Computer Systems with Delayed Experience Injection

Oct 31, 2016

Michael Schaarschmidt, Felix Gessert, Valentin Dalibard, Eiko Yoneki

Figure 1 for Learning Runtime Parameters in Computer Systems with Delayed Experience Injection

Figure 2 for Learning Runtime Parameters in Computer Systems with Delayed Experience Injection

Figure 3 for Learning Runtime Parameters in Computer Systems with Delayed Experience Injection

Abstract:Learning effective configurations in computer systems without hand-crafting models for every parameter is a long-standing problem. This paper investigates the use of deep reinforcement learning for runtime parameters of cloud databases under latency constraints. Cloud services serve up to thousands of concurrent requests per second and can adjust critical parameters by leveraging performance metrics. In this work, we use continuous deep reinforcement learning to learn optimal cache expirations for HTTP caching in content delivery networks. To this end, we introduce a technique for asynchronous experience management called delayed experience injection, which facilitates delayed reward and next-state computation in concurrent environments where measurements are not immediately available. Evaluation results show that our approach based on normalized advantage functions and asynchronous CPU-only training outperforms a statistical estimator.

* Deep Reinforcement Learning Workshop, NIPS 2016

Via

Access Paper or Ask Questions