Alert button

Neural Policy Gradient Methods: Global Optimality and Rates of Convergence

Aug 29, 2019
Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: