Value-driven Hindsight Modelling

Feb 19, 2020
Arthur Guez, Fabio Viola, Théophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

* 8 pages + reference + appendix 

  Access Model/Code and Paper
An investigation of model-free planning

Jan 11, 2019
Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap


  Access Model/Code and Paper
Credit Assignment Techniques in Stochastic Computation Graphs

Jan 07, 2019
Théophane Weber, Nicolas Heess, Lars Buesing, David Silver


  Access Model/Code and Paper
Single-Agent Policy Tree Search With Guarantees

Nov 28, 2018
Laurent Orseau, Levi H. S. Lelis, Tor Lattimore, Théophane Weber

* 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montr\'eal, Canada 

  Access Model/Code and Paper
Learning to Search with MCTSnets

Jul 17, 2018
Arthur Guez, Théophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Rémi Munos, David Silver

* ICML 2018 (camera-ready version) 

  Access Model/Code and Paper
Imagination-Augmented Agents for Deep Reinforcement Learning

Feb 14, 2018
Théophane Weber, Sébastien Racanière, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adria Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter Battaglia, Demis Hassabis, David Silver, Daan Wierstra


  Access Model/Code and Paper
Learning model-based planning from scratch

Jul 19, 2017
Razvan Pascanu, Yujia Li, Oriol Vinyals, Nicolas Heess, Lars Buesing, Sebastien Racanière, David Reichert, Théophane Weber, Daan Wierstra, Peter Battaglia


  Access Model/Code and Paper