Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Andre Barreto

Discovering Diverse Nearly Optimal Policies withSuccessor Features


Jun 01, 2021
Tom Zahavy, Brendan O'Donoghue, Andre Barreto, Volodymyr Mnih, Sebastian Flennerhag, Satinder Singh


  Access Paper or Ask Questions

Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning


Feb 24, 2021
Víctor Campos, Pablo Sprechmann, Steven Hansen, Andre Barreto, Steven Kapturowski, Alex Vitvitskyi, Adrià Puigdomènech Badia, Charles Blundell


  Access Paper or Ask Questions

Discovering a set of policies for the worst case reward


Feb 08, 2021
Tom Zahavy, Andre Barreto, Daniel J Mankowitz, Shaobo Hou, Brendan O'Donoghue, Iurii Kemaev, Satinder Baveja Singh


  Access Paper or Ask Questions

Temporal Difference Uncertainties as a Signal for Exploration


Oct 05, 2020
Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, Andre Barreto, Razvan Pascanu

* 8 pages, 11 figures, 5 tables 

  Access Paper or Ask Questions

Disentangled Cumulants Help Successor Representations Transfer to New Tasks


Nov 25, 2019
Christopher Grimm, Irina Higgins, Andre Barreto, Denis Teplyashin, Markus Wulfmeier, Tim Hertweck, Raia Hadsell, Satinder Singh


  Access Paper or Ask Questions

General non-linear Bellman equations


Jul 08, 2019
Hado van Hasselt, John Quan, Matteo Hessel, Zhongwen Xu, Diana Borsa, Andre Barreto


  Access Paper or Ask Questions

Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates


Jun 19, 2019
Hugo Penedones, Carlos Riquelme, Damien Vincent, Hartmut Maennel, Timothy Mann, Andre Barreto, Sylvain Gelly, Gergely Neu


  Access Paper or Ask Questions

Fast Task Inference with Variational Intrinsic Successor Features


Jun 12, 2019
Steven Hansen, Will Dabney, Andre Barreto, Tom Van de Wiele, David Warde-Farley, Volodymyr Mnih


  Access Paper or Ask Questions

Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction


Dec 05, 2018
Jonathan J Hunt, Andre Barreto, Timothy P Lillicrap, Nicolas Heess


  Access Paper or Ask Questions

Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem


Jul 09, 2018
Hugo Penedones, Damien Vincent, Hartmut Maennel, Sylvain Gelly, Timothy Mann, Andre Barreto


  Access Paper or Ask Questions

The Predictron: End-To-End Learning and Planning


Jul 20, 2017
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

* Camera-ready version, ICML 2017, with supplement 

  Access Paper or Ask Questions