Alert button
Picture for Simon S. Du

Simon S. Du

Alert button

On the Power of Multitask Representation Learning in Linear MDP

Add code
Bookmark button
Alert button
Jun 15, 2021
Rui Lu, Gao Huang, Simon S. Du

Figure 1 for On the Power of Multitask Representation Learning in Linear MDP
Figure 2 for On the Power of Multitask Representation Learning in Linear MDP
Viaarxiv icon

Provable Adaptation across Multiway Domains via Representation Learning

Add code
Bookmark button
Alert button
Jun 12, 2021
Zhili Feng, Shaobo Han, Simon S. Du

Figure 1 for Provable Adaptation across Multiway Domains via Representation Learning
Figure 2 for Provable Adaptation across Multiway Domains via Representation Learning
Figure 3 for Provable Adaptation across Multiway Domains via Representation Learning
Figure 4 for Provable Adaptation across Multiway Domains via Representation Learning
Viaarxiv icon

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

Add code
Bookmark button
Alert button
Apr 22, 2021
Jean Tarbouriech, Runlong Zhou, Simon S. Du, Matteo Pirotta, Michal Valko, Alessandro Lazaric

Figure 1 for Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Figure 2 for Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Viaarxiv icon

Nearly Horizon-Free Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 25, 2021
Tongzheng Ren, Jialian Li, Bo Dai, Simon S. Du, Sujay Sanghavi

Figure 1 for Nearly Horizon-Free Offline Reinforcement Learning
Figure 2 for Nearly Horizon-Free Offline Reinforcement Learning
Viaarxiv icon

Bilinear Classes: A Structural Framework for Provable Generalization in RL

Add code
Bookmark button
Alert button
Mar 19, 2021
Simon S. Du, Sham M. Kakade, Jason D. Lee, Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang

Figure 1 for Bilinear Classes: A Structural Framework for Provable Generalization in RL
Figure 2 for Bilinear Classes: A Structural Framework for Provable Generalization in RL
Viaarxiv icon

Improved Corruption Robust Algorithms for Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 08, 2021
Yifang Chen, Simon S. Du, Kevin Jamieson

Viaarxiv icon

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP

Add code
Bookmark button
Alert button
Feb 19, 2021
Zihan Zhang, Jiaqi Yang, Xiangyang Ji, Simon S. Du

Viaarxiv icon

Randomized Exploration is Near-Optimal for Tabular MDP

Add code
Bookmark button
Alert button
Feb 19, 2021
Zhihan Xiong, Ruoqi Shen, Simon S. Du

Figure 1 for Randomized Exploration is Near-Optimal for Tabular MDP
Figure 2 for Randomized Exploration is Near-Optimal for Tabular MDP
Figure 3 for Randomized Exploration is Near-Optimal for Tabular MDP
Viaarxiv icon

Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Feb 17, 2021
Yulai Zhao, Yuandong Tian, Jason D. Lee, Simon S. Du

Viaarxiv icon