Picture for Ohad Shamir

Ohad Shamir

The Connection Between Approximation, Depth Separation and Learnability in Neural Networks

Add code
Jan 31, 2021
Viaarxiv icon

Implicit Regularization in ReLU Networks with the Square Loss

Add code
Dec 15, 2020
Figure 1 for Implicit Regularization in ReLU Networks with the Square Loss
Figure 2 for Implicit Regularization in ReLU Networks with the Square Loss
Viaarxiv icon

High-Order Oracle Complexity of Smooth and Strongly Convex Optimization

Add code
Oct 13, 2020
Viaarxiv icon

Gradient Methods Never Overfit On Separable Data

Add code
Jun 30, 2020
Figure 1 for Gradient Methods Never Overfit On Separable Data
Viaarxiv icon

Neural Networks with Small Weights and Depth-Separation Barriers

Add code
Jun 03, 2020
Viaarxiv icon

The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks

Add code
Jun 01, 2020
Figure 1 for The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks
Figure 2 for The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks
Figure 3 for The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks
Viaarxiv icon

Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?

Add code
Mar 14, 2020
Figure 1 for Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?
Figure 2 for Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?
Figure 3 for Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?
Viaarxiv icon

Is Local SGD Better than Minibatch SGD?

Add code
Feb 18, 2020
Figure 1 for Is Local SGD Better than Minibatch SGD?
Figure 2 for Is Local SGD Better than Minibatch SGD?
Figure 3 for Is Local SGD Better than Minibatch SGD?
Viaarxiv icon

Learning a Single Neuron with Gradient Methods

Add code
Feb 11, 2020
Figure 1 for Learning a Single Neuron with Gradient Methods
Figure 2 for Learning a Single Neuron with Gradient Methods
Viaarxiv icon

Proving the Lottery Ticket Hypothesis: Pruning is All You Need

Add code
Feb 03, 2020
Viaarxiv icon