Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Zhuoran Yang

Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality


Feb 27, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

* Submitted for publication 

  Access Paper or Ask Questions

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning


Feb 19, 2021
Luofeng Liao, Zuyue Fu, Zhuoran Yang, Mladen Kolar, Zhaoran Wang

* under review 

  Access Paper or Ask Questions

A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization


Feb 15, 2021
Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

* 34 Pages, 10 Figures 

  Access Paper or Ask Questions

Is Pessimism Provably Efficient for Offline RL?


Dec 30, 2020
Ying Jin, Zhuoran Yang, Zhaoran Wang

* 53 pages, 3 figures 

  Access Paper or Ask Questions

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy


Dec 28, 2020
Han Zhong, Ethan X. Fang, Zhuoran Yang, Zhaoran Wang

* 45 pages 

  Access Paper or Ask Questions

Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization


Dec 21, 2020
Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang

* 58 pages 

  Access Paper or Ask Questions

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations


Nov 09, 2020
Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael I. Jordan

* 76 pages. The short version of this work appears in NeurIPS 2020 

  Access Paper or Ask Questions

Provable Fictitious Play for General Mean-Field Games


Oct 08, 2020
Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, Andreea Minca


  Access Paper or Ask Questions

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning


Aug 23, 2020
Shuang Qiu, Zhuoran Yang, Xiaohan Wei, Jieping Ye, Zhaoran Wang

* 45 pages; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time


Aug 16, 2020
Weichen Wang, Jiequn Han, Zhuoran Yang, Zhaoran Wang

* 28 pages, 3 figures 

  Access Paper or Ask Questions

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy


Aug 02, 2020
Zuyue Fu, Zhuoran Yang, Zhaoran Wang


  Access Paper or Ask Questions

Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model


Jul 16, 2020
Jianqing Fan, Zhuoran Yang, Mengxin Yu


  Access Paper or Ask Questions

A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic


Jul 10, 2020
Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang


  Access Paper or Ask Questions

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach


Jul 02, 2020
Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

* Submitted to NeurIPS 2020. Under review 

  Access Paper or Ask Questions

Dynamic Regret of Policy Optimization in Non-stationary Environments


Jun 30, 2020
Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie


  Access Paper or Ask Questions

On the Global Optimality of Model-Agnostic Meta-Learning


Jun 23, 2020
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

* 41 pages; accepted to ICML; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret


Jun 22, 2020
Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, Qiaomin Xie


  Access Paper or Ask Questions

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data


Jun 22, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

* 42 pages, 4 figures 

  Access Paper or Ask Questions

Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning


Jun 21, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

* 31 pages; accepted to ICML; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Neural Certificates for Safe Control Policies


Jun 15, 2020
Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou


  Access Paper or Ask Questions

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory


Jun 08, 2020
Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang


  Access Paper or Ask Questions

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium


Mar 21, 2020
Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang


  Access Paper or Ask Questions

Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate


Mar 08, 2020
Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang


  Access Paper or Ask Questions

Semiparametric Nonlinear Bipartite Graph Representation Learning with Provable Guarantees


Mar 02, 2020
Sen Na, Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar


  Access Paper or Ask Questions

Upper Confidence Primal-Dual Optimization: Stochastically Constrained Markov Decision Processes with Adversarial Losses and Unknown Transitions


Mar 02, 2020
Shuang Qiu, Xiaohan Wei, Zhuoran Yang, Jieping Ye, Zhaoran Wang


  Access Paper or Ask Questions

Provably Efficient Safe Exploration via Primal-Dual Policy Optimization


Mar 01, 2020
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

* 41 pages 

  Access Paper or Ask Questions

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework


Feb 10, 2020
Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

* corrected typos & references 

  Access Paper or Ask Questions

On Computation and Generalization of Generative Adversarial Imitation Learning


Jan 12, 2020
Minshuo Chen, Yizhou Wang, Tianyi Liu, Zhuoran Yang, Xingguo Li, Zhaoran Wang, Tuo Zhao


  Access Paper or Ask Questions