Alert button

Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods

Sep 13, 2021
Xin Guo, Anran Hu, Junzi Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: