Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Differentiable Bilevel Programming for Stackelberg Congestion Games

Add code
Bookmark button
Alert button
Sep 15, 2022
Jiayang Li, Jing Yu, Qianni Wang, Boyi Liu, Zhaoran Wang, Yu Marco Nie

Figure 1 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 2 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 3 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Figure 4 for Differentiable Bilevel Programming for Stackelberg Congestion Games
Viaarxiv icon

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 29, 2022
Shuang Qiu, Lingxiao Wang, Chenjia Bai, Zhuoran Yang, Zhaoran Wang

Figure 1 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 2 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 3 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 4 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Viaarxiv icon

Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions

Add code
Bookmark button
Alert button
Jul 25, 2022
Shuang Qiu, Xiaohan Wei, Jieping Ye, Zhaoran Wang, Zhuoran Yang

Figure 1 for Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Figure 2 for Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Viaarxiv icon

Federated Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 11, 2022
Doudou Zhou, Yufeng Zhang, Aaron Sonabend-W, Zhaoran Wang, Junwei Lu, Tianxi Cai

Figure 1 for Federated Offline Reinforcement Learning
Figure 2 for Federated Offline Reinforcement Learning
Figure 3 for Federated Offline Reinforcement Learning
Viaarxiv icon

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

Add code
Bookmark button
Alert button
Jun 06, 2022
Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han

Figure 1 for RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Figure 2 for RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Figure 3 for RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Figure 4 for RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Viaarxiv icon

Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes

Add code
Bookmark button
Alert button
May 26, 2022
Miao Lu, Yifei Min, Zhaoran Wang, Zhuoran Yang

Figure 1 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 2 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 3 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 4 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Viaarxiv icon

Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency

Add code
Bookmark button
Alert button
May 26, 2022
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Figure 1 for Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Viaarxiv icon

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

Add code
Bookmark button
Alert button
May 24, 2022
Xiaoyu Chen, Han Zhong, Zhuoran Yang, Zhaoran Wang, Liwei Wang

Viaarxiv icon

Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

Add code
Bookmark button
Alert button
May 05, 2022
Boxiang Lyu, Zhaoran Wang, Mladen Kolar, Zhuoran Yang

Viaarxiv icon

Sample-Efficient Reinforcement Learning for POMDPs with Linear Function Approximations

Add code
Bookmark button
Alert button
Apr 20, 2022
Qi Cai, Zhuoran Yang, Zhaoran Wang

Figure 1 for Sample-Efficient Reinforcement Learning for POMDPs with Linear Function Approximations
Figure 2 for Sample-Efficient Reinforcement Learning for POMDPs with Linear Function Approximations
Viaarxiv icon