Alert button
Picture for Achraf Bahamou

Achraf Bahamou

Alert button

Layer-wise Adaptive Step-Sizes for Stochastic First-Order Methods for Deep Learning

Add code
Bookmark button
Alert button
May 23, 2023
Achraf Bahamou, Donald Goldfarb

Figure 1 for Layer-wise Adaptive Step-Sizes for Stochastic First-Order Methods for Deep Learning
Figure 2 for Layer-wise Adaptive Step-Sizes for Stochastic First-Order Methods for Deep Learning
Figure 3 for Layer-wise Adaptive Step-Sizes for Stochastic First-Order Methods for Deep Learning
Figure 4 for Layer-wise Adaptive Step-Sizes for Stochastic First-Order Methods for Deep Learning
Viaarxiv icon

A Mini-Block Natural Gradient Method for Deep Neural Networks

Add code
Bookmark button
Alert button
Feb 16, 2022
Achraf Bahamou, Donald Goldfarb, Yi Ren

Figure 1 for A Mini-Block Natural Gradient Method for Deep Neural Networks
Figure 2 for A Mini-Block Natural Gradient Method for Deep Neural Networks
Figure 3 for A Mini-Block Natural Gradient Method for Deep Neural Networks
Figure 4 for A Mini-Block Natural Gradient Method for Deep Neural Networks
Viaarxiv icon

Practical Quasi-Newton Methods for Training Deep Neural Networks

Add code
Bookmark button
Alert button
Jun 16, 2020
Donald Goldfarb, Yi Ren, Achraf Bahamou

Figure 1 for Practical Quasi-Newton Methods for Training Deep Neural Networks
Figure 2 for Practical Quasi-Newton Methods for Training Deep Neural Networks
Figure 3 for Practical Quasi-Newton Methods for Training Deep Neural Networks
Figure 4 for Practical Quasi-Newton Methods for Training Deep Neural Networks
Viaarxiv icon

Stochastic Flows and Geometric Optimization on the Orthogonal Group

Add code
Bookmark button
Alert button
Mar 30, 2020
Krzysztof Choromanski, David Cheikhi, Jared Davis, Valerii Likhosherstov, Achille Nazaret, Achraf Bahamou, Xingyou Song, Mrugank Akarte, Jack Parker-Holder, Jacob Bergquist, Yuan Gao, Aldo Pacchiano, Tamas Sarlos, Adrian Weller, Vikas Sindhwani

Figure 1 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 2 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 3 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 4 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Viaarxiv icon

A Dynamic Sampling Adaptive-SGD Method for Machine Learning

Add code
Bookmark button
Alert button
Dec 31, 2019
Achraf Bahamou, Donald Goldfarb

Figure 1 for A Dynamic Sampling Adaptive-SGD Method for Machine Learning
Figure 2 for A Dynamic Sampling Adaptive-SGD Method for Machine Learning
Figure 3 for A Dynamic Sampling Adaptive-SGD Method for Machine Learning
Figure 4 for A Dynamic Sampling Adaptive-SGD Method for Machine Learning
Viaarxiv icon