Alert button
Picture for Sham M. Kakade

Sham M. Kakade

Alert button

The Benefits of Implicit Regularization from SGD in Least Squares Problems

Add code
Bookmark button
Alert button
Aug 10, 2021
Difan Zou, Jingfeng Wu, Vladimir Braverman, Quanquan Gu, Dean P. Foster, Sham M. Kakade

Figure 1 for The Benefits of Implicit Regularization from SGD in Least Squares Problems
Viaarxiv icon

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

Add code
Bookmark button
Alert button
Jul 14, 2021
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

Figure 1 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Figure 2 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Viaarxiv icon

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Add code
Bookmark button
Alert button
Jul 09, 2021
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

Figure 1 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Figure 2 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Viaarxiv icon

A Short Note on the Relationship of Information Gain and Eluder Dimension

Add code
Bookmark button
Alert button
Jul 06, 2021
Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei

Viaarxiv icon

Benign Overfitting of Constant-Stepsize SGD for Linear Regression

Add code
Bookmark button
Alert button
Mar 23, 2021
Difan Zou, Jingfeng Wu, Vladimir Braverman, Quanquan Gu, Sham M. Kakade

Figure 1 for Benign Overfitting of Constant-Stepsize SGD for Linear Regression
Figure 2 for Benign Overfitting of Constant-Stepsize SGD for Linear Regression
Viaarxiv icon

An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap

Add code
Bookmark button
Alert button
Mar 23, 2021
Yuanhao Wang, Ruosong Wang, Sham M. Kakade

Figure 1 for An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Figure 2 for An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Viaarxiv icon

Bilinear Classes: A Structural Framework for Provable Generalization in RL

Add code
Bookmark button
Alert button
Mar 19, 2021
Simon S. Du, Sham M. Kakade, Jason D. Lee, Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang

Figure 1 for Bilinear Classes: A Structural Framework for Provable Generalization in RL
Figure 2 for Bilinear Classes: A Structural Framework for Provable Generalization in RL
Viaarxiv icon

Instabilities of Offline RL with Pre-Trained Neural Representation

Add code
Bookmark button
Alert button
Mar 08, 2021
Ruosong Wang, Yifan Wu, Ruslan Salakhutdinov, Sham M. Kakade

Figure 1 for Instabilities of Offline RL with Pre-Trained Neural Representation
Figure 2 for Instabilities of Offline RL with Pre-Trained Neural Representation
Figure 3 for Instabilities of Offline RL with Pre-Trained Neural Representation
Figure 4 for Instabilities of Offline RL with Pre-Trained Neural Representation
Viaarxiv icon

What are the Statistical Limits of Offline RL with Linear Function Approximation?

Add code
Bookmark button
Alert button
Oct 22, 2020
Ruosong Wang, Dean P. Foster, Sham M. Kakade

Figure 1 for What are the Statistical Limits of Offline RL with Linear Function Approximation?
Figure 2 for What are the Statistical Limits of Offline RL with Linear Function Approximation?
Viaarxiv icon

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

Add code
Bookmark button
Alert button
Jul 15, 2020
Kaiqing Zhang, Sham M. Kakade, Tamer Başar, Lin F. Yang

Figure 1 for Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Viaarxiv icon