Picture for Andre Barreto

Andre Barreto

Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction

Add code
Dec 05, 2018
Figure 1 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 2 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 3 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 4 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Viaarxiv icon

Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem

Add code
Jul 09, 2018
Figure 1 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 2 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 3 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 4 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Viaarxiv icon

The Predictron: End-To-End Learning and Planning

Add code
Jul 20, 2017
Figure 1 for The Predictron: End-To-End Learning and Planning
Figure 2 for The Predictron: End-To-End Learning and Planning
Figure 3 for The Predictron: End-To-End Learning and Planning
Figure 4 for The Predictron: End-To-End Learning and Planning
Viaarxiv icon