Alert button

Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples

Sep 26, 2019
Tengyu Xu, Shaofeng Zou, Yingbin Liang

Figure 1 for Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples
Figure 2 for Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples
Figure 3 for Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: