Alert button

Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators

Jun 24, 2021
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: