Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Progressive Batching L-BFGS Method for Machine Learning

May 30, 2018

Raghu Bollapragada, Dheevatsa Mudigere, Jorge Nocedal, Hao-Jun Michael Shi, Ping Tak Peter Tang

Figure 1 for A Progressive Batching L-BFGS Method for Machine Learning

Figure 2 for A Progressive Batching L-BFGS Method for Machine Learning

Figure 3 for A Progressive Batching L-BFGS Method for Machine Learning

Figure 4 for A Progressive Batching L-BFGS Method for Machine Learning

Share this with someone who'll enjoy it:

Abstract:The standard L-BFGS method relies on gradient approximations that are not dominated by noise, so that search directions are descent directions, the line search is reliable, and quasi-Newton updating yields useful quadratic models of the objective function. All of this appears to call for a full batch approach, but since small batch sizes give rise to faster algorithms with better generalization properties, L-BFGS is currently not considered an algorithm of choice for large-scale machine learning applications. One need not, however, choose between the two extremes represented by the full batch or highly stochastic regimes, and may instead follow a progressive batching approach in which the sample size increases during the course of the optimization. In this paper, we present a new version of the L-BFGS algorithm that combines three basic components - progressive batching, a stochastic line search, and stable quasi-Newton updating - and that performs well on training logistic regression and deep neural networks. We provide supporting convergence theory for the method.

* ICML 2018. 25 pages, 17 figures, 2 tables

View paper on

Share this with someone who'll enjoy it:

Title:A Progressive Batching L-BFGS Method for Machine Learning

Paper and Code