Alert button

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

Feb 11, 2024
Kaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: