Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Is Pessimism Provably Efficient for Offline RL?

Dec 30, 2020
Ying Jin, Zhuoran Yang, Zhaoran Wang

* 53 pages, 3 figures 

  Access Paper or Ask Questions

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy

Dec 28, 2020
Han Zhong, Ethan X. Fang, Zhuoran Yang, Zhaoran Wang

* 45 pages 

  Access Paper or Ask Questions

Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization

Dec 21, 2020
Zhuoran Yang, Yufeng Zhang, Yongxin Chen, Zhaoran Wang

* 58 pages 

  Access Paper or Ask Questions

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations

Nov 09, 2020
Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael I. Jordan

* 76 pages. The short version of this work appears in NeurIPS 2020 

  Access Paper or Ask Questions

Provable Fictitious Play for General Mean-Field Games

Oct 08, 2020
Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, Andreea Minca


  Access Paper or Ask Questions

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning

Aug 23, 2020
Shuang Qiu, Zhuoran Yang, Xiaohan Wei, Jieping Ye, Zhaoran Wang

* 45 pages; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time

Aug 16, 2020
Weichen Wang, Jiequn Han, Zhuoran Yang, Zhaoran Wang

* 28 pages, 3 figures 

  Access Paper or Ask Questions

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy

Aug 02, 2020
Zuyue Fu, Zhuoran Yang, Zhaoran Wang


  Access Paper or Ask Questions

Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model

Jul 16, 2020
Jianqing Fan, Zhuoran Yang, Mengxin Yu


  Access Paper or Ask Questions

A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic

Jul 10, 2020
Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang


  Access Paper or Ask Questions

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

Jul 02, 2020
Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

* Submitted to NeurIPS 2020. Under review 

  Access Paper or Ask Questions

Dynamic Regret of Policy Optimization in Non-stationary Environments

Jun 30, 2020
Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie


  Access Paper or Ask Questions

On the Global Optimality of Model-Agnostic Meta-Learning

Jun 23, 2020
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

* 41 pages; accepted to ICML; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

Jun 22, 2020
Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, Qiaomin Xie


  Access Paper or Ask Questions

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

Jun 22, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

* 42 pages, 4 figures 

  Access Paper or Ask Questions

Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning

Jun 21, 2020
Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

* 31 pages; accepted to ICML; initial draft submitted in Feb, 2020 

  Access Paper or Ask Questions

Neural Certificates for Safe Control Policies

Jun 15, 2020
Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou


  Access Paper or Ask Questions

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

Jun 08, 2020
Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang


  Access Paper or Ask Questions

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Mar 21, 2020
Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang


  Access Paper or Ask Questions

Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate

Mar 08, 2020
Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang


  Access Paper or Ask Questions

Semiparametric Nonlinear Bipartite Graph Representation Learning with Provable Guarantees

Mar 02, 2020
Sen Na, Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar


  Access Paper or Ask Questions

Upper Confidence Primal-Dual Optimization: Stochastically Constrained Markov Decision Processes with Adversarial Losses and Unknown Transitions

Mar 02, 2020
Shuang Qiu, Xiaohan Wei, Zhuoran Yang, Jieping Ye, Zhaoran Wang


  Access Paper or Ask Questions

Provably Efficient Safe Exploration via Primal-Dual Policy Optimization

Mar 01, 2020
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

* 41 pages 

  Access Paper or Ask Questions

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

Feb 10, 2020
Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

* corrected typos & references 

  Access Paper or Ask Questions

On Computation and Generalization of Generative Adversarial Imitation Learning

Jan 12, 2020
Minshuo Chen, Yizhou Wang, Tianyi Liu, Zhuoran Yang, Xingguo Li, Zhaoran Wang, Tuo Zhao


  Access Paper or Ask Questions

Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Dec 14, 2019
Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar


  Access Paper or Ask Questions

Provably Efficient Exploration in Policy Optimization

Dec 12, 2019
Qi Cai, Zhuoran Yang, Chi Jin, Zhaoran Wang


  Access Paper or Ask Questions