Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Zhuoran Yang

Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning


Nov 06, 2021
Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang


  Access Paper or Ask Questions

SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning


Oct 24, 2021
Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Zhaoran Wang, Jing Jiang


  Access Paper or Ask Questions

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game


Oct 19, 2021
Shuang Qiu, Jieping Ye, Zhaoran Wang, Zhuoran Yang

* ICML 2021 

  Access Paper or Ask Questions

Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs


Oct 18, 2021
Han Zhong, Zhuoran Yang, Zhaoran Wang Csaba Szepesvári


  Access Paper or Ask Questions

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima


Oct 12, 2021
Boyi Liu, Jiayang Li, Zhuoran Yang, Hoi-To Wai, Mingyi Hong, Yu Marco Nie, Zhaoran Wang

* 31 pages; typos corrected 

  Access Paper or Ask Questions

Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation


Aug 19, 2021
Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang

* 54 pages, in submission 

  Access Paper or Ask Questions

Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning


Aug 08, 2021
Pratik Ramprasad, Yuantong Li, Zhuoran Yang, Zhaoran Wang, Will Wei Sun, Guang Cheng


  Access Paper or Ask Questions

Towards General Function Approximation in Zero-Sum Markov Games


Jul 30, 2021
Baihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang


  Access Paper or Ask Questions

A Unified Off-Policy Evaluation Approach for General Value Function


Jul 06, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

* submitted for publication 

  Access Paper or Ask Questions

Gap-Dependent Bounds for Two-Player Markov Games


Jul 01, 2021
Zehao Dou, Zhuoran Yang, Zhaoran Wang, Simon S. Du

* 34 pages 

  Access Paper or Ask Questions

Randomized Exploration for Reinforcement Learning with General Value Function Approximation


Jun 15, 2021
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F. Yang

* 32 page, 5 figures, in Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021 

  Access Paper or Ask Questions

Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality


Feb 27, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

* Submitted for publication 

  Access Paper or Ask Questions

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning


Feb 19, 2021
Luofeng Liao, Zuyue Fu, Zhuoran Yang, Mladen Kolar, Zhaoran Wang

* under review 

  Access Paper or Ask Questions

A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization


Feb 15, 2021
Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

* 34 Pages, 10 Figures 

  Access Paper or Ask Questions

Is Pessimism Provably Efficient for Offline RL?


Dec 30, 2020
Ying Jin, Zhuoran Yang, Zhaoran Wang

* 53 pages, 3 figures 

  Access Paper or Ask Questions

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy


Dec 28, 2020
Han Zhong, Ethan X. Fang, Zhuoran Yang, Zhaoran Wang

* 45 pages 

  Access Paper or Ask Questions

Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization


Dec 21, 2020
Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang

* 58 pages 

  Access Paper or Ask Questions

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations


Nov 09, 2020
Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael I. Jordan

* 76 pages. The short version of this work appears in NeurIPS 2020 

  Access Paper or Ask Questions

Provable Fictitious Play for General Mean-Field Games


Oct 08, 2020
Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, Andreea Minca


  Access Paper or Ask Questions

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning


Aug 23, 2020
Shuang Qiu, Zhuoran Yang, Xiaohan Wei, Jieping Ye, Zhaoran Wang

* 45 pages; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time


Aug 16, 2020
Weichen Wang, Jiequn Han, Zhuoran Yang, Zhaoran Wang

* 28 pages, 3 figures 

  Access Paper or Ask Questions

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy


Aug 02, 2020
Zuyue Fu, Zhuoran Yang, Zhaoran Wang


  Access Paper or Ask Questions

Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model


Jul 16, 2020
Jianqing Fan, Zhuoran Yang, Mengxin Yu


  Access Paper or Ask Questions

A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic


Jul 10, 2020
Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang


  Access Paper or Ask Questions

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach


Jul 02, 2020
Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

* Submitted to NeurIPS 2020. Under review 

  Access Paper or Ask Questions

Dynamic Regret of Policy Optimization in Non-stationary Environments


Jun 30, 2020
Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie


  Access Paper or Ask Questions

On the Global Optimality of Model-Agnostic Meta-Learning


Jun 23, 2020
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

* 41 pages; accepted to ICML; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret


Jun 22, 2020
Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, Qiaomin Xie


  Access Paper or Ask Questions

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data


Jun 22, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

* 42 pages, 4 figures 

  Access Paper or Ask Questions