V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Sep 26, 2019
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M. Botvinick

* * equal contribution 

  Access Model/Code and Paper
TF-Replicator: Distributed Machine Learning for Researchers

Feb 01, 2019
Peter Buchlovsky, David Budden, Dominik Grewe, Chris Jones, John Aslanides, Frederic Besse, Andy Brock, Aidan Clark, Sergio G贸mez Colmenarejo, Aedan Pope, Fabio Viola, Dan Belov


  Access Model/Code and Paper
Relative Entropy Regularized Policy Iteration

Dec 05, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin Riedmiller


  Access Model/Code and Paper
Parallel WaveNet: Fast High-Fidelity Speech Synthesis

Nov 28, 2017
Aaron van den Oord, Yazhe Li, Igor Babuschkin, Karen Simonyan, Oriol Vinyals, Koray Kavukcuoglu, George van den Driessche, Edward Lockhart, Luis C. Cobo, Florian Stimberg, Norman Casagrande, Dominik Grewe, Seb Noury, Sander Dieleman, Erich Elsen, Nal Kalchbrenner, Heiga Zen, Alex Graves, Helen King, Tom Walters, Dan Belov, Demis Hassabis


  Access Model/Code and Paper
Parallel Multiscale Autoregressive Density Estimation

Mar 10, 2017
Scott Reed, A盲ron van den Oord, Nal Kalchbrenner, Sergio G贸mez Colmenarejo, Ziyu Wang, Dan Belov, Nando de Freitas


  Access Model/Code and Paper