Picture for Sham M. Kakade

Sham M. Kakade

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

Add code
Jul 14, 2021
Figure 1 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Figure 2 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Viaarxiv icon

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Add code
Jul 09, 2021
Figure 1 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Figure 2 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Viaarxiv icon

A Short Note on the Relationship of Information Gain and Eluder Dimension

Add code
Jul 06, 2021
Viaarxiv icon

Benign Overfitting of Constant-Stepsize SGD for Linear Regression

Add code
Mar 23, 2021
Figure 1 for Benign Overfitting of Constant-Stepsize SGD for Linear Regression
Figure 2 for Benign Overfitting of Constant-Stepsize SGD for Linear Regression
Viaarxiv icon

An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap

Add code
Mar 23, 2021
Figure 1 for An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Figure 2 for An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Viaarxiv icon

Bilinear Classes: A Structural Framework for Provable Generalization in RL

Add code
Mar 19, 2021
Figure 1 for Bilinear Classes: A Structural Framework for Provable Generalization in RL
Figure 2 for Bilinear Classes: A Structural Framework for Provable Generalization in RL
Viaarxiv icon

Instabilities of Offline RL with Pre-Trained Neural Representation

Add code
Mar 08, 2021
Figure 1 for Instabilities of Offline RL with Pre-Trained Neural Representation
Figure 2 for Instabilities of Offline RL with Pre-Trained Neural Representation
Figure 3 for Instabilities of Offline RL with Pre-Trained Neural Representation
Figure 4 for Instabilities of Offline RL with Pre-Trained Neural Representation
Viaarxiv icon

What are the Statistical Limits of Offline RL with Linear Function Approximation?

Add code
Oct 22, 2020
Figure 1 for What are the Statistical Limits of Offline RL with Linear Function Approximation?
Figure 2 for What are the Statistical Limits of Offline RL with Linear Function Approximation?
Viaarxiv icon

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

Add code
Jul 15, 2020
Figure 1 for Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Viaarxiv icon

Sample-Efficient Reinforcement Learning of Undercomplete POMDPs

Add code
Jun 22, 2020
Viaarxiv icon