Alert button
Picture for Saad Biaz

Saad Biaz

Alert button

Stable and Efficient Policy Evaluation

Add code
Bookmark button
Alert button
Jun 06, 2020
Daoming Lyu, Bo Liu, Matthieu Geist, Wen Dong, Saad Biaz, Qi Wang

Figure 1 for Stable and Efficient Policy Evaluation
Figure 2 for Stable and Efficient Policy Evaluation
Figure 3 for Stable and Efficient Policy Evaluation
Figure 4 for Stable and Efficient Policy Evaluation
Viaarxiv icon

O$^2$TD: (Near)-Optimal Off-Policy TD Learning

Add code
Bookmark button
Alert button
Apr 19, 2017
Bo Liu, Daoming Lyu, Wen Dong, Saad Biaz

Figure 1 for O$^2$TD: (Near)-Optimal Off-Policy TD Learning
Figure 2 for O$^2$TD: (Near)-Optimal Off-Policy TD Learning
Figure 3 for O$^2$TD: (Near)-Optimal Off-Policy TD Learning
Figure 4 for O$^2$TD: (Near)-Optimal Off-Policy TD Learning
Viaarxiv icon