Alert button
Picture for Andre Barreto

Andre Barreto

Alert button

Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction

Add code
Bookmark button
Alert button
Dec 05, 2018
Jonathan J Hunt, Andre Barreto, Timothy P Lillicrap, Nicolas Heess

Figure 1 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 2 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 3 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 4 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Viaarxiv icon

Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem

Add code
Bookmark button
Alert button
Jul 09, 2018
Hugo Penedones, Damien Vincent, Hartmut Maennel, Sylvain Gelly, Timothy Mann, Andre Barreto

Figure 1 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 2 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 3 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 4 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Viaarxiv icon

The Predictron: End-To-End Learning and Planning

Add code
Bookmark button
Alert button
Jul 20, 2017
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

Figure 1 for The Predictron: End-To-End Learning and Planning
Figure 2 for The Predictron: End-To-End Learning and Planning
Figure 3 for The Predictron: End-To-End Learning and Planning
Figure 4 for The Predictron: End-To-End Learning and Planning
Viaarxiv icon