Alert button
Picture for Zhuoran Yang

Zhuoran Yang

Alert button

SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 24, 2021
Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Zhaoran Wang, Jing Jiang

Figure 1 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Figure 2 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Figure 3 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Figure 4 for SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning
Viaarxiv icon

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

Add code
Bookmark button
Alert button
Oct 19, 2021
Shuang Qiu, Jieping Ye, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs

Add code
Bookmark button
Alert button
Oct 18, 2021
Han Zhong, Zhuoran Yang, Zhaoran Wang Csaba Szepesvári

Figure 1 for Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Viaarxiv icon

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima

Add code
Bookmark button
Alert button
Oct 12, 2021
Boyi Liu, Jiayang Li, Zhuoran Yang, Hoi-To Wai, Mingyi Hong, Yu Marco Nie, Zhaoran Wang

Figure 1 for Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima
Figure 2 for Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima
Figure 3 for Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima
Viaarxiv icon

Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation

Add code
Bookmark button
Alert button
Aug 19, 2021
Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 08, 2021
Pratik Ramprasad, Yuantong Li, Zhuoran Yang, Zhaoran Wang, Will Wei Sun, Guang Cheng

Figure 1 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Figure 2 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Figure 3 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Figure 4 for Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Viaarxiv icon

Towards General Function Approximation in Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Jul 30, 2021
Baihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

A Unified Off-Policy Evaluation Approach for General Value Function

Add code
Bookmark button
Alert button
Jul 06, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

Figure 1 for A Unified Off-Policy Evaluation Approach for General Value Function
Figure 2 for A Unified Off-Policy Evaluation Approach for General Value Function
Viaarxiv icon

Gap-Dependent Bounds for Two-Player Markov Games

Add code
Bookmark button
Alert button
Jul 01, 2021
Zehao Dou, Zhuoran Yang, Zhaoran Wang, Simon S. Du

Viaarxiv icon

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Add code
Bookmark button
Alert button
Jun 15, 2021
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F. Yang

Figure 1 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 2 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 3 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 4 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Viaarxiv icon