Alert button
Picture for Zhuoran Yang

Zhuoran Yang

Alert button

Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality

Add code
Bookmark button
Alert button
Feb 27, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

Figure 1 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Figure 2 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Figure 3 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Viaarxiv icon

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 19, 2021
Luofeng Liao, Zuyue Fu, Zhuoran Yang, Mladen Kolar, Zhaoran Wang

Figure 1 for Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Figure 2 for Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Viaarxiv icon

A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization

Add code
Bookmark button
Alert button
Feb 15, 2021
Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

Figure 1 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Figure 2 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Figure 3 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Figure 4 for A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization
Viaarxiv icon

Is Pessimism Provably Efficient for Offline RL?

Add code
Bookmark button
Alert button
Dec 30, 2020
Ying Jin, Zhuoran Yang, Zhaoran Wang

Figure 1 for Is Pessimism Provably Efficient for Offline RL?
Figure 2 for Is Pessimism Provably Efficient for Offline RL?
Figure 3 for Is Pessimism Provably Efficient for Offline RL?
Viaarxiv icon

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy

Add code
Bookmark button
Alert button
Dec 28, 2020
Han Zhong, Ethan X. Fang, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization

Add code
Bookmark button
Alert button
Dec 21, 2020
Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang

Viaarxiv icon

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations

Add code
Bookmark button
Alert button
Nov 09, 2020
Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael I. Jordan

Figure 1 for Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations
Viaarxiv icon

Provable Fictitious Play for General Mean-Field Games

Add code
Bookmark button
Alert button
Oct 08, 2020
Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, Andreea Minca

Viaarxiv icon

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning

Add code
Bookmark button
Alert button
Aug 23, 2020
Shuang Qiu, Zhuoran Yang, Xiaohan Wei, Jieping Ye, Zhaoran Wang

Figure 1 for Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Viaarxiv icon