Picture for Suriya Gunasekar

Suriya Gunasekar

Microsoft Research

Generalization to translation shifts: a study in architectures and augmentations

Add code
Jul 05, 2022
Figure 1 for Generalization to translation shifts: a study in architectures and augmentations
Figure 2 for Generalization to translation shifts: a study in architectures and augmentations
Figure 3 for Generalization to translation shifts: a study in architectures and augmentations
Figure 4 for Generalization to translation shifts: a study in architectures and augmentations
Viaarxiv icon

Unveiling Transformers with LEGO: a synthetic reasoning task

Add code
Jun 09, 2022
Figure 1 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 2 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 3 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 4 for Unveiling Transformers with LEGO: a synthetic reasoning task
Viaarxiv icon

Data Augmentation as Feature Manipulation: a story of desert cows and grass cows

Add code
Mar 03, 2022
Figure 1 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 2 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 3 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 4 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Viaarxiv icon

Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm

Add code
Feb 24, 2021
Figure 1 for Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm
Figure 2 for Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm
Figure 3 for Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm
Figure 4 for Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm
Viaarxiv icon

NeurIPS 2020 Competition: Predicting Generalization in Deep Learning

Add code
Dec 14, 2020
Figure 1 for NeurIPS 2020 Competition: Predicting Generalization in Deep Learning
Viaarxiv icon

Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy

Add code
Jul 13, 2020
Figure 1 for Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy
Figure 2 for Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy
Figure 3 for Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy
Figure 4 for Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy
Viaarxiv icon

Mirrorless Mirror Descent: A More Natural Discretization of Riemannian Gradient Flow

Add code
Apr 24, 2020
Viaarxiv icon

Kernel and Rich Regimes in Overparametrized Models

Add code
Feb 24, 2020
Figure 1 for Kernel and Rich Regimes in Overparametrized Models
Figure 2 for Kernel and Rich Regimes in Overparametrized Models
Figure 3 for Kernel and Rich Regimes in Overparametrized Models
Figure 4 for Kernel and Rich Regimes in Overparametrized Models
Viaarxiv icon

Implicit Regularization of Normalization Methods

Add code
Nov 23, 2019
Figure 1 for Implicit Regularization of Normalization Methods
Figure 2 for Implicit Regularization of Normalization Methods
Figure 3 for Implicit Regularization of Normalization Methods
Figure 4 for Implicit Regularization of Normalization Methods
Viaarxiv icon

Kernel and Deep Regimes in Overparametrized Models

Add code
Jun 13, 2019
Figure 1 for Kernel and Deep Regimes in Overparametrized Models
Figure 2 for Kernel and Deep Regimes in Overparametrized Models
Figure 3 for Kernel and Deep Regimes in Overparametrized Models
Figure 4 for Kernel and Deep Regimes in Overparametrized Models
Viaarxiv icon