Alert button
Picture for Zachary Nado

Zachary Nado

Alert button

A Loss Curvature Perspective on Training Instability in Deep Learning

Add code
Bookmark button
Alert button
Oct 08, 2021
Justin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George Dahl, Zachary Nado, Orhan Firat

Figure 1 for A Loss Curvature Perspective on Training Instability in Deep Learning
Figure 2 for A Loss Curvature Perspective on Training Instability in Deep Learning
Figure 3 for A Loss Curvature Perspective on Training Instability in Deep Learning
Figure 4 for A Loss Curvature Perspective on Training Instability in Deep Learning
Viaarxiv icon

Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning

Add code
Bookmark button
Alert button
Jun 07, 2021
Zachary Nado, Neil Band, Mark Collier, Josip Djolonga, Michael W. Dusenberry, Sebastian Farquhar, Angelos Filos, Marton Havasi, Rodolphe Jenatton, Ghassen Jerfel, Jeremiah Liu, Zelda Mariet, Jeremy Nixon, Shreyas Padhy, Jie Ren, Tim G. J. Rudner, Yeming Wen, Florian Wenzel, Kevin Murphy, D. Sculley, Balaji Lakshminarayanan, Jasper Snoek, Yarin Gal, Dustin Tran

Figure 1 for Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Figure 2 for Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Figure 3 for Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Figure 4 for Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Viaarxiv icon

A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes

Add code
Bookmark button
Alert button
Feb 16, 2021
Zachary Nado, Justin M. Gilmer, Christopher J. Shallue, Rohan Anil, George E. Dahl

Figure 1 for A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes
Figure 2 for A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes
Figure 3 for A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes
Figure 4 for A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes
Viaarxiv icon

Underspecification Presents Challenges for Credibility in Modern Machine Learning

Add code
Bookmark button
Alert button
Nov 06, 2020
Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne, Rajiv Raman, Kim Ramasamy, Rory Sayres, Jessica Schrouff, Martin Seneviratne, Shannon Sequeira, Harini Suresh, Victor Veitch, Max Vladymyrov, Xuezhi Wang, Kellie Webster, Steve Yadlowsky, Taedong Yun, Xiaohua Zhai, D. Sculley

Figure 1 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Figure 2 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Figure 3 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Figure 4 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Viaarxiv icon

Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift

Add code
Bookmark button
Alert button
Jul 17, 2020
Zachary Nado, Shreyas Padhy, D. Sculley, Alexander D'Amour, Balaji Lakshminarayanan, Jasper Snoek

Figure 1 for Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift
Figure 2 for Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift
Figure 3 for Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift
Figure 4 for Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift
Viaarxiv icon

Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks

Add code
Bookmark button
Alert button
Jul 10, 2020
Shreyas Padhy, Zachary Nado, Jie Ren, Jeremiah Liu, Jasper Snoek, Balaji Lakshminarayanan

Figure 1 for Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks
Figure 2 for Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks
Figure 3 for Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks
Figure 4 for Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks
Viaarxiv icon

On Empirical Comparisons of Optimizers for Deep Learning

Add code
Bookmark button
Alert button
Oct 11, 2019
Dami Choi, Christopher J. Shallue, Zachary Nado, Jaehoon Lee, Chris J. Maddison, George E. Dahl

Figure 1 for On Empirical Comparisons of Optimizers for Deep Learning
Figure 2 for On Empirical Comparisons of Optimizers for Deep Learning
Figure 3 for On Empirical Comparisons of Optimizers for Deep Learning
Figure 4 for On Empirical Comparisons of Optimizers for Deep Learning
Viaarxiv icon

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Add code
Bookmark button
Alert button
Jul 09, 2019
Guodong Zhang, Lala Li, Zachary Nado, James Martens, Sushant Sachdeva, George E. Dahl, Christopher J. Shallue, Roger Grosse

Figure 1 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 2 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 3 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 4 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Viaarxiv icon