Alert button
Picture for Zhuoran Yang

Zhuoran Yang

Alert button

Neural Policy Gradient Methods: Global Optimality and Rates of Convergence

Add code
Bookmark button
Alert button
Aug 29, 2019
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Viaarxiv icon

Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization

Add code
Bookmark button
Alert button
Aug 24, 2019
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Figure 1 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Figure 2 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Figure 3 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Figure 4 for Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Viaarxiv icon

Robust One-Bit Recovery via ReLU Generative Networks: Improved Statistical Rates and Global Landscape Analysis

Add code
Bookmark button
Alert button
Aug 14, 2019
Shuang Qiu, Xiaohan Wei, Zhuoran Yang

Figure 1 for Robust One-Bit Recovery via ReLU Generative Networks: Improved Statistical Rates and Global Landscape Analysis
Viaarxiv icon

Provably Efficient Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Aug 08, 2019
Chi Jin, Zhuoran Yang, Zhaoran Wang, Michael I. Jordan

Viaarxiv icon

Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization

Add code
Bookmark button
Alert button
Aug 07, 2019
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Figure 1 for Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Figure 2 for Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Figure 3 for Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Figure 4 for Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization
Viaarxiv icon

More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning

Add code
Bookmark button
Alert button
Jul 14, 2019
Xinyang Yi, Zhaoran Wang, Zhuoran Yang, Constantine Caramanis, Han Liu

Figure 1 for More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning
Viaarxiv icon

On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost

Add code
Bookmark button
Alert button
Jul 14, 2019
Zhuoran Yang, Yongxin Chen, Mingyi Hong, Zhaoran Wang

Viaarxiv icon

Stochastic Convergence Results for Regularized Actor-Critic Methods

Add code
Bookmark button
Alert button
Jul 13, 2019
Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, Ji Liu

Figure 1 for Stochastic Convergence Results for Regularized Actor-Critic Methods
Figure 2 for Stochastic Convergence Results for Regularized Actor-Critic Methods
Viaarxiv icon

A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 06, 2019
Yixuan Lin, Kaiqing Zhang, Zhuoran Yang, Zhaoran Wang, Tamer Başar, Romeil Sandhu, Ji Liu

Viaarxiv icon