Picture for Sham M. Kakade

Sham M. Kakade

Stochastic Gradient Descent Escapes Saddle Points Efficiently

Add code
Feb 13, 2019
Figure 1 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 2 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 3 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 4 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Viaarxiv icon

Maximum Likelihood Estimation for Learning Populations of Parameters

Add code
Feb 12, 2019
Figure 1 for Maximum Likelihood Estimation for Learning Populations of Parameters
Figure 2 for Maximum Likelihood Estimation for Learning Populations of Parameters
Figure 3 for Maximum Likelihood Estimation for Learning Populations of Parameters
Figure 4 for Maximum Likelihood Estimation for Learning Populations of Parameters
Viaarxiv icon

A Short Note on Concentration Inequalities for Random Vectors with SubGaussian Norm

Add code
Feb 11, 2019
Viaarxiv icon

A Smoother Way to Train Structured Prediction Models

Add code
Feb 08, 2019
Figure 1 for A Smoother Way to Train Structured Prediction Models
Figure 2 for A Smoother Way to Train Structured Prediction Models
Figure 3 for A Smoother Way to Train Structured Prediction Models
Figure 4 for A Smoother Way to Train Structured Prediction Models
Viaarxiv icon

Provably Efficient Maximum Entropy Exploration

Add code
Dec 06, 2018
Figure 1 for Provably Efficient Maximum Entropy Exploration
Figure 2 for Provably Efficient Maximum Entropy Exploration
Figure 3 for Provably Efficient Maximum Entropy Exploration
Viaarxiv icon

Coupled Recurrent Models for Polyphonic Music Composition

Add code
Nov 20, 2018
Figure 1 for Coupled Recurrent Models for Polyphonic Music Composition
Figure 2 for Coupled Recurrent Models for Polyphonic Music Composition
Figure 3 for Coupled Recurrent Models for Polyphonic Music Composition
Figure 4 for Coupled Recurrent Models for Polyphonic Music Composition
Viaarxiv icon

Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator

Add code
Oct 21, 2018
Figure 1 for Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Viaarxiv icon

On the insufficiency of existing momentum schemes for Stochastic Optimization

Add code
Jul 31, 2018
Figure 1 for On the insufficiency of existing momentum schemes for Stochastic Optimization
Figure 2 for On the insufficiency of existing momentum schemes for Stochastic Optimization
Figure 3 for On the insufficiency of existing momentum schemes for Stochastic Optimization
Figure 4 for On the insufficiency of existing momentum schemes for Stochastic Optimization
Viaarxiv icon

Accelerating Stochastic Gradient Descent For Least Squares Regression

Add code
Jul 31, 2018
Figure 1 for Accelerating Stochastic Gradient Descent For Least Squares Regression
Figure 2 for Accelerating Stochastic Gradient Descent For Least Squares Regression
Figure 3 for Accelerating Stochastic Gradient Descent For Least Squares Regression
Figure 4 for Accelerating Stochastic Gradient Descent For Least Squares Regression
Viaarxiv icon

Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification

Add code
Jul 31, 2018
Figure 1 for Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification
Figure 2 for Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification
Figure 3 for Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification
Figure 4 for Parallelizing Stochastic Gradient Descent for Least Squares Regression: mini-batching, averaging, and model misspecification
Viaarxiv icon