Picture for Sham M. Kakade

Sham M. Kakade

Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?

Add code
May 01, 2020
Viaarxiv icon

Few-Shot Learning via Learning the Representation, Provably

Add code
Feb 21, 2020
Viaarxiv icon

The Nonstochastic Control Problem

Add code
Jan 20, 2020
Viaarxiv icon

Robust Aggregation for Federated Learning

Add code
Dec 31, 2019
Figure 1 for Robust Aggregation for Federated Learning
Figure 2 for Robust Aggregation for Federated Learning
Figure 3 for Robust Aggregation for Federated Learning
Figure 4 for Robust Aggregation for Federated Learning
Viaarxiv icon

Optimal Estimation of Change in a Population of Parameters

Add code
Nov 28, 2019
Figure 1 for Optimal Estimation of Change in a Population of Parameters
Figure 2 for Optimal Estimation of Change in a Population of Parameters
Figure 3 for Optimal Estimation of Change in a Population of Parameters
Figure 4 for Optimal Estimation of Change in a Population of Parameters
Viaarxiv icon

Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?

Add code
Nov 03, 2019
Figure 1 for Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
Figure 2 for Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
Figure 3 for Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
Figure 4 for Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
Viaarxiv icon

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

Add code
Aug 29, 2019
Figure 1 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Figure 2 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Figure 3 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Figure 4 for Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
Viaarxiv icon

Calibration, Entropy Rates, and Memory in Language Models

Add code
Jun 11, 2019
Figure 1 for Calibration, Entropy Rates, and Memory in Language Models
Figure 2 for Calibration, Entropy Rates, and Memory in Language Models
Figure 3 for Calibration, Entropy Rates, and Memory in Language Models
Figure 4 for Calibration, Entropy Rates, and Memory in Language Models
Viaarxiv icon

The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure

Add code
Apr 29, 2019
Figure 1 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Figure 2 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Figure 3 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Figure 4 for The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure
Viaarxiv icon

Online Control with Adversarial Disturbances

Add code
Feb 23, 2019
Viaarxiv icon