Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Neural Policy Gradient Methods: Global Optimality and Rates of Convergence

Add code
Bookmark button
Alert button
Oct 07, 2019
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization

Add code
Bookmark button
Alert button
Aug 24, 2019
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Figure 1 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Figure 2 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Figure 3 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Figure 4 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Viaarxiv icon

Provably Efficient Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Aug 08, 2019
Chi Jin, Zhuoran Yang, Zhaoran Wang, Michael I. Jordan

Viaarxiv icon

More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning

Add code
Bookmark button
Alert button
Jul 14, 2019
Xinyang Yi, Zhaoran Wang, Zhuoran Yang, Constantine Caramanis, Han Liu

Figure 1 for More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning
Viaarxiv icon

On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost

Add code
Bookmark button
Alert button
Jul 14, 2019
Zhuoran Yang, Yongxin Chen, Mingyi Hong, Zhaoran Wang

Viaarxiv icon

A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 06, 2019
Yixuan Lin, Kaiqing Zhang, Zhuoran Yang, Zhaoran Wang, Tamer Başar, Romeil Sandhu, Ji Liu

Viaarxiv icon

Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy

Add code
Bookmark button
Alert button
Jun 25, 2019
Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Neural Temporal-Difference Learning Converges to Global Optima

Add code
Bookmark button
Alert button
May 24, 2019
Qi Cai, Zhuoran Yang, Jason D. Lee, Zhaoran Wang

Viaarxiv icon

A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 18, 2019
Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, Zhaoran Wang, Tamer Basar, Ji Liu

Viaarxiv icon

On the Global Convergence of Imitation Learning: A Case for Linear Quadratic Regulator

Add code
Bookmark button
Alert button
Jan 11, 2019
Qi Cai, Mingyi Hong, Yongxin Chen, Zhaoran Wang

Viaarxiv icon