Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning

Add code
Oct 12, 2020
Figure 1 for Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning
Figure 2 for Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning
Figure 3 for Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning
Figure 4 for Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: