Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy

Add code
Bookmark button
Alert button
Dec 28, 2020
Han Zhong, Ethan X. Fang, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization

Add code
Bookmark button
Alert button
Dec 21, 2020
Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang

Viaarxiv icon

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations

Add code
Bookmark button
Alert button
Nov 09, 2020
Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael I. Jordan

Figure 1 for Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations
Viaarxiv icon

End-to-End Learning and Intervention in Games

Add code
Bookmark button
Alert button
Oct 26, 2020
Jiayang Li, Jing Yu, Yu Marco Nie, Zhaoran Wang

Figure 1 for End-to-End Learning and Intervention in Games
Figure 2 for End-to-End Learning and Intervention in Games
Figure 3 for End-to-End Learning and Intervention in Games
Figure 4 for End-to-End Learning and Intervention in Games
Viaarxiv icon

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 17, 2020
Chenjia Bai, Peng Liu, Zhaoran Wang, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao

Figure 1 for Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Figure 2 for Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Figure 3 for Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Figure 4 for Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Viaarxiv icon

Provable Fictitious Play for General Mean-Field Games

Add code
Bookmark button
Alert button
Oct 08, 2020
Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, Andreea Minca

Viaarxiv icon

Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection

Add code
Bookmark button
Alert button
Sep 04, 2020
Yining Wang, Yi Chen, Ethan X. Fang, Zhaoran Wang, Runze Li

Figure 1 for Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection
Figure 2 for Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection
Figure 3 for Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection
Figure 4 for Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection
Viaarxiv icon

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning

Add code
Bookmark button
Alert button
Aug 23, 2020
Shuang Qiu, Zhuoran Yang, Xiaohan Wei, Jieping Ye, Zhaoran Wang

Figure 1 for Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Viaarxiv icon

Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time

Add code
Bookmark button
Alert button
Aug 16, 2020
Weichen Wang, Jiequn Han, Zhuoran Yang, Zhaoran Wang

Figure 1 for Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time
Figure 2 for Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time
Viaarxiv icon

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy

Add code
Bookmark button
Alert button
Aug 02, 2020
Zuyue Fu, Zhuoran Yang, Zhaoran Wang

Figure 1 for Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Viaarxiv icon