Alert button
Picture for Sham M. Kakade

Sham M. Kakade

Alert button

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

Add code
Bookmark button
Alert button
Aug 29, 2019
Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan

Figure 1 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Figure 2 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Figure 3 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Figure 4 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Viaarxiv icon

Calibration, Entropy Rates, and Memory in Language Models

Add code
Bookmark button
Alert button
Jun 11, 2019
Mark Braverman, Xinyi Chen, Sham M. Kakade, Karthik Narasimhan, Cyril Zhang, Yi Zhang

Figure 1 for Calibration, Entropy Rates, and Memory in Language Models
Figure 2 for Calibration, Entropy Rates, and Memory in Language Models
Figure 3 for Calibration, Entropy Rates, and Memory in Language Models
Figure 4 for Calibration, Entropy Rates, and Memory in Language Models
Viaarxiv icon

The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure

Add code
Bookmark button
Alert button
Apr 29, 2019
Rong Ge, Sham M. Kakade, Rahul Kidambi, Praneeth Netrapalli

Figure 1 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Figure 2 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Figure 3 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Figure 4 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Viaarxiv icon

Online Control with Adversarial Disturbances

Add code
Bookmark button
Alert button
Feb 23, 2019
Naman Agarwal, Brian Bullins, Elad Hazan, Sham M. Kakade, Karan Singh

Viaarxiv icon

Stochastic Gradient Descent Escapes Saddle Points Efficiently

Add code
Bookmark button
Alert button
Feb 13, 2019
Chi Jin, Praneeth Netrapalli, Rong Ge, Sham M. Kakade, Michael I. Jordan

Figure 1 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 2 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 3 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 4 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Viaarxiv icon

Maximum Likelihood Estimation for Learning Populations of Parameters

Add code
Bookmark button
Alert button
Feb 12, 2019
Ramya Korlakai Vinayak, Weihao Kong, Gregory Valiant, Sham M. Kakade

Figure 1 for Maximum Likelihood Estimation for Learning Populations of Parameters
Figure 2 for Maximum Likelihood Estimation for Learning Populations of Parameters
Figure 3 for Maximum Likelihood Estimation for Learning Populations of Parameters
Figure 4 for Maximum Likelihood Estimation for Learning Populations of Parameters
Viaarxiv icon

A Short Note on Concentration Inequalities for Random Vectors with SubGaussian Norm

Add code
Bookmark button
Alert button
Feb 11, 2019
Chi Jin, Praneeth Netrapalli, Rong Ge, Sham M. Kakade, Michael I. Jordan

Viaarxiv icon

A Smoother Way to Train Structured Prediction Models

Add code
Bookmark button
Alert button
Feb 08, 2019
Krishna Pillutla, Vincent Roulet, Sham M. Kakade, Zaid Harchaoui

Figure 1 for A Smoother Way to Train Structured Prediction Models
Figure 2 for A Smoother Way to Train Structured Prediction Models
Figure 3 for A Smoother Way to Train Structured Prediction Models
Figure 4 for A Smoother Way to Train Structured Prediction Models
Viaarxiv icon

Provably Efficient Maximum Entropy Exploration

Add code
Bookmark button
Alert button
Dec 06, 2018
Elad Hazan, Sham M. Kakade, Karan Singh, Abby Van Soest

Figure 1 for Provably Efficient Maximum Entropy Exploration
Figure 2 for Provably Efficient Maximum Entropy Exploration
Figure 3 for Provably Efficient Maximum Entropy Exploration
Viaarxiv icon