Alert button
Picture for Shankar Krishnan

Shankar Krishnan

Alert button

Benchmarking Neural Network Training Algorithms

Add code
Bookmark button
Alert button
Jun 12, 2023
George E. Dahl, Frank Schneider, Zachary Nado, Naman Agarwal, Chandramouli Shama Sastry, Philipp Hennig, Sourabh Medapati, Runa Eschenhagen, Priya Kasimbeg, Daniel Suo, Juhan Bae, Justin Gilmer, Abel L. Peirson, Bilal Khan, Rohan Anil, Mike Rabbat, Shankar Krishnan, Daniel Snider, Ehsan Amid, Kongtao Chen, Chris J. Maddison, Rakshith Vasudev, Michal Badura, Ankush Garg, Peter Mattson

Figure 1 for Benchmarking Neural Network Training Algorithms
Figure 2 for Benchmarking Neural Network Training Algorithms
Figure 3 for Benchmarking Neural Network Training Algorithms
Figure 4 for Benchmarking Neural Network Training Algorithms
Viaarxiv icon

Adaptive Gradient Methods at the Edge of Stability

Add code
Bookmark button
Alert button
Jul 29, 2022
Jeremy M. Cohen, Behrooz Ghorbani, Shankar Krishnan, Naman Agarwal, Sourabh Medapati, Michal Badura, Daniel Suo, David Cardoze, Zachary Nado, George E. Dahl, Justin Gilmer

Figure 1 for Adaptive Gradient Methods at the Edge of Stability
Figure 2 for Adaptive Gradient Methods at the Edge of Stability
Figure 3 for Adaptive Gradient Methods at the Edge of Stability
Figure 4 for Adaptive Gradient Methods at the Edge of Stability
Viaarxiv icon

A Unifying View on Implicit Bias in Training Linear Neural Networks

Add code
Bookmark button
Alert button
Oct 06, 2020
Chulhee Yun, Shankar Krishnan, Hossein Mobahi

Figure 1 for A Unifying View on Implicit Bias in Training Linear Neural Networks
Figure 2 for A Unifying View on Implicit Bias in Training Linear Neural Networks
Viaarxiv icon

Explaining Memorization and Generalization: A Large-Scale Study with Coherent Gradients

Add code
Bookmark button
Alert button
Mar 16, 2020
Piotr Zielinski, Shankar Krishnan, Satrajit Chatterjee

Figure 1 for Explaining Memorization and Generalization: A Large-Scale Study with Coherent Gradients
Figure 2 for Explaining Memorization and Generalization: A Large-Scale Study with Coherent Gradients
Figure 3 for Explaining Memorization and Generalization: A Large-Scale Study with Coherent Gradients
Figure 4 for Explaining Memorization and Generalization: A Large-Scale Study with Coherent Gradients
Viaarxiv icon

Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks

Add code
Bookmark button
Alert button
Nov 21, 2019
Saurabh Singh, Shankar Krishnan

Figure 1 for Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Figure 2 for Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Figure 3 for Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Figure 4 for Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Viaarxiv icon

An Investigation into Neural Net Optimization via Hessian Eigenvalue Density

Add code
Bookmark button
Alert button
Jan 29, 2019
Behrooz Ghorbani, Shankar Krishnan, Ying Xiao

Figure 1 for An Investigation into Neural Net Optimization via Hessian Eigenvalue Density
Figure 2 for An Investigation into Neural Net Optimization via Hessian Eigenvalue Density
Figure 3 for An Investigation into Neural Net Optimization via Hessian Eigenvalue Density
Figure 4 for An Investigation into Neural Net Optimization via Hessian Eigenvalue Density
Viaarxiv icon

Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks

Add code
Bookmark button
Alert button
Dec 08, 2017
Shankar Krishnan, Ying Xiao, Rif A. Saurous

Figure 1 for Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks
Figure 2 for Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks
Figure 3 for Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks
Figure 4 for Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks
Viaarxiv icon

Achieving Approximate Soft Clustering in Data Streams

Add code
Bookmark button
Alert button
Jul 26, 2012
Vaneet Aggarwal, Shankar Krishnan

Figure 1 for Achieving Approximate Soft Clustering in Data Streams
Figure 2 for Achieving Approximate Soft Clustering in Data Streams
Viaarxiv icon