Alert button
Picture for Roger Grosse

Roger Grosse

Alert button

When Does Preconditioning Help or Hurt Generalization?

Add code
Bookmark button
Alert button
Jun 25, 2020
Shun-ichi Amari, Jimmy Ba, Roger Grosse, Xuechen Li, Atsushi Nitanda, Taiji Suzuki, Denny Wu, Ji Xu

Figure 1 for When Does Preconditioning Help or Hurt Generalization?
Figure 2 for When Does Preconditioning Help or Hurt Generalization?
Figure 3 for When Does Preconditioning Help or Hurt Generalization?
Figure 4 for When Does Preconditioning Help or Hurt Generalization?
Viaarxiv icon

Understanding and mitigating exploding inverses in invertible neural networks

Add code
Bookmark button
Alert button
Jun 16, 2020
Jens Behrmann, Paul Vicol, Kuan-Chieh Wang, Roger Grosse, Jörn-Henrik Jacobsen

Figure 1 for Understanding and mitigating exploding inverses in invertible neural networks
Figure 2 for Understanding and mitigating exploding inverses in invertible neural networks
Figure 3 for Understanding and mitigating exploding inverses in invertible neural networks
Figure 4 for Understanding and mitigating exploding inverses in invertible neural networks
Viaarxiv icon

Picking Winning Tickets Before Training by Preserving Gradient Flow

Add code
Bookmark button
Alert button
Feb 18, 2020
Chaoqi Wang, Guodong Zhang, Roger Grosse

Figure 1 for Picking Winning Tickets Before Training by Preserving Gradient Flow
Figure 2 for Picking Winning Tickets Before Training by Preserving Gradient Flow
Figure 3 for Picking Winning Tickets Before Training by Preserving Gradient Flow
Figure 4 for Picking Winning Tickets Before Training by Preserving Gradient Flow
Viaarxiv icon

Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks

Add code
Bookmark button
Alert button
Nov 09, 2019
Qiyang Li, Saminul Haque, Cem Anil, James Lucas, Roger Grosse, Jörn-Henrik Jacobsen

Figure 1 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Figure 2 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Figure 3 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Figure 4 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Viaarxiv icon

Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse

Add code
Bookmark button
Alert button
Nov 06, 2019
James Lucas, George Tucker, Roger Grosse, Mohammad Norouzi

Figure 1 for Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse
Figure 2 for Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse
Figure 3 for Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse
Figure 4 for Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse
Viaarxiv icon

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Add code
Bookmark button
Alert button
Jul 09, 2019
Guodong Zhang, Lala Li, Zachary Nado, James Martens, Sushant Sachdeva, George E. Dahl, Christopher J. Shallue, Roger Grosse

Figure 1 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 2 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 3 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 4 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Viaarxiv icon

Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks

Add code
Bookmark button
Alert button
May 27, 2019
Guodong Zhang, James Martens, Roger Grosse

Figure 1 for Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks
Viaarxiv icon

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis

Add code
Bookmark button
Alert button
May 15, 2019
Chaoqi Wang, Roger Grosse, Sanja Fidler, Guodong Zhang

Figure 1 for EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
Figure 2 for EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
Viaarxiv icon