Picture for Roger Grosse

Roger Grosse

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Add code
Jul 09, 2019
Figure 1 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 2 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 3 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 4 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Viaarxiv icon

Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks

Add code
May 27, 2019
Figure 1 for Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks
Viaarxiv icon

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis

Add code
May 15, 2019
Figure 1 for EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
Figure 2 for EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
Viaarxiv icon

Functional Variational Bayesian Neural Networks

Add code
Mar 14, 2019
Figure 1 for Functional Variational Bayesian Neural Networks
Figure 2 for Functional Variational Bayesian Neural Networks
Figure 3 for Functional Variational Bayesian Neural Networks
Figure 4 for Functional Variational Bayesian Neural Networks
Viaarxiv icon

Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions

Add code
Mar 07, 2019
Figure 1 for Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions
Figure 2 for Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions
Figure 3 for Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions
Figure 4 for Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions
Viaarxiv icon

Eigenvalue Corrected Noisy Natural Gradient

Add code
Nov 30, 2018
Figure 1 for Eigenvalue Corrected Noisy Natural Gradient
Figure 2 for Eigenvalue Corrected Noisy Natural Gradient
Figure 3 for Eigenvalue Corrected Noisy Natural Gradient
Figure 4 for Eigenvalue Corrected Noisy Natural Gradient
Viaarxiv icon

Sorting out Lipschitz function approximation

Add code
Nov 13, 2018
Figure 1 for Sorting out Lipschitz function approximation
Figure 2 for Sorting out Lipschitz function approximation
Figure 3 for Sorting out Lipschitz function approximation
Figure 4 for Sorting out Lipschitz function approximation
Viaarxiv icon

Three Mechanisms of Weight Decay Regularization

Add code
Oct 29, 2018
Figure 1 for Three Mechanisms of Weight Decay Regularization
Figure 2 for Three Mechanisms of Weight Decay Regularization
Figure 3 for Three Mechanisms of Weight Decay Regularization
Figure 4 for Three Mechanisms of Weight Decay Regularization
Viaarxiv icon

Reversible Recurrent Neural Networks

Add code
Oct 25, 2018
Figure 1 for Reversible Recurrent Neural Networks
Figure 2 for Reversible Recurrent Neural Networks
Figure 3 for Reversible Recurrent Neural Networks
Figure 4 for Reversible Recurrent Neural Networks
Viaarxiv icon

Isolating Sources of Disentanglement in Variational Autoencoders

Add code
Oct 22, 2018
Figure 1 for Isolating Sources of Disentanglement in Variational Autoencoders
Figure 2 for Isolating Sources of Disentanglement in Variational Autoencoders
Figure 3 for Isolating Sources of Disentanglement in Variational Autoencoders
Figure 4 for Isolating Sources of Disentanglement in Variational Autoencoders
Viaarxiv icon