Picture for Maximus Mutschler

Maximus Mutschler

Using a one dimensional parabolic model of the full-batch loss to estimate learning rates during training

Add code
Aug 31, 2021
Figure 1 for Using a one dimensional parabolic model of the full-batch loss to estimate learning rates during training
Figure 2 for Using a one dimensional parabolic model of the full-batch loss to estimate learning rates during training
Figure 3 for Using a one dimensional parabolic model of the full-batch loss to estimate learning rates during training
Figure 4 for Using a one dimensional parabolic model of the full-batch loss to estimate learning rates during training
Viaarxiv icon

Empirically explaining SGD from a line search perspective

Add code
Mar 31, 2021
Figure 1 for Empirically explaining SGD from a line search perspective
Figure 2 for Empirically explaining SGD from a line search perspective
Figure 3 for Empirically explaining SGD from a line search perspective
Figure 4 for Empirically explaining SGD from a line search perspective
Viaarxiv icon

A straightforward line search approach on the expected empirical loss for stochastic deep learning problems

Add code
Oct 02, 2020
Figure 1 for A straightforward line search approach on the expected empirical loss for stochastic deep learning problems
Figure 2 for A straightforward line search approach on the expected empirical loss for stochastic deep learning problems
Figure 3 for A straightforward line search approach on the expected empirical loss for stochastic deep learning problems
Figure 4 for A straightforward line search approach on the expected empirical loss for stochastic deep learning problems
Viaarxiv icon

PAL: A fast DNN optimization method based on curvature information

Add code
Mar 28, 2019
Figure 1 for PAL: A fast DNN optimization method based on curvature information
Figure 2 for PAL: A fast DNN optimization method based on curvature information
Figure 3 for PAL: A fast DNN optimization method based on curvature information
Figure 4 for PAL: A fast DNN optimization method based on curvature information
Viaarxiv icon