Picture for Ohad Shamir

Ohad Shamir

The Complexity of Finding Stationary Points with Stochastic Gradient Descent

Add code
Oct 04, 2019
Figure 1 for The Complexity of Finding Stationary Points with Stochastic Gradient Descent
Figure 2 for The Complexity of Finding Stationary Points with Stochastic Gradient Descent
Viaarxiv icon

How Good is SGD with Random Shuffling?

Add code
Jul 31, 2019
Figure 1 for How Good is SGD with Random Shuffling?
Viaarxiv icon

Depth Separations in Neural Networks: What is Actually Being Separated?

Add code
May 26, 2019
Figure 1 for Depth Separations in Neural Networks: What is Actually Being Separated?
Figure 2 for Depth Separations in Neural Networks: What is Actually Being Separated?
Viaarxiv icon

On the Power and Limitations of Random Features for Understanding Neural Networks

Add code
Apr 01, 2019
Viaarxiv icon

Space lower bounds for linear prediction

Add code
Feb 23, 2019
Viaarxiv icon

The Complexity of Making the Gradient Small in Stochastic Convex Optimization

Add code
Feb 14, 2019
Figure 1 for The Complexity of Making the Gradient Small in Stochastic Convex Optimization
Viaarxiv icon

Global Non-convex Optimization with Discretized Diffusions

Add code
Oct 29, 2018
Figure 1 for Global Non-convex Optimization with Discretized Diffusions
Viaarxiv icon

Are ResNets Provably Better than Linear Predictors?

Add code
Sep 27, 2018
Figure 1 for Are ResNets Provably Better than Linear Predictors?
Viaarxiv icon

Exponential Convergence Time of Gradient Descent for One-Dimensional Deep Linear Neural Networks

Add code
Sep 27, 2018
Figure 1 for Exponential Convergence Time of Gradient Descent for One-Dimensional Deep Linear Neural Networks
Figure 2 for Exponential Convergence Time of Gradient Descent for One-Dimensional Deep Linear Neural Networks
Viaarxiv icon

Spurious Local Minima are Common in Two-Layer ReLU Neural Networks

Add code
Aug 09, 2018
Figure 1 for Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Figure 2 for Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Figure 3 for Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Viaarxiv icon