Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Shaking the foundations: delusions in sequence models for interaction and control



Pedro A. Ortega , Markus Kunesch , Grégoire Delétang , Tim Genewein , Jordi Grau-Moya , Joel Veness , Jonas Buchli , Jonas Degrave , Bilal Piot , Julien Perolat , Tom Everitt , Corentin Tallec , Emilio Parisotto , Tom Erez , Yutian Chen , Scott Reed , Marcus Hutter , Nando de Freitas , Shane Legg

* DeepMind Tech Report, 16 pages, 4 figures 

   Access Paper or Ask Questions

Local Search for Policy Iteration in Continuous Control



Jost Tobias Springenberg , Nicolas Heess , Daniel Mankowitz , Josh Merel , Arunkumar Byravan , Abbas Abdolmaleki , Jackie Kay , Jonas Degrave , Julian Schrittwieser , Yuval Tassa , Jonas Buchli , Dan Belov , Martin Riedmiller


   Access Paper or Ask Questions

Quinoa: a Q-function You Infer Normalized Over Actions



Jonas Degrave , Abbas Abdolmaleki , Jost Tobias Springenberg , Nicolas Heess , Martin Riedmiller

* Deep RL Workshop/NeurIPS 

   Access Paper or Ask Questions

Self-supervised Learning of Image Embedding for Continuous Control



Carlos Florensa , Jonas Degrave , Nicolas Heess , Jost Tobias Springenberg , Martin Riedmiller

* Contributed talk at Inference to Control workshop at NeurIPS2018 

   Access Paper or Ask Questions

Relative Entropy Regularized Policy Iteration



Abbas Abdolmaleki , Jost Tobias Springenberg , Jonas Degrave , Steven Bohez , Yuval Tassa , Dan Belov , Nicolas Heess , Martin Riedmiller


   Access Paper or Ask Questions

A Differentiable Physics Engine for Deep Learning in Robotics



Jonas Degrave , Michiel Hermans , Joni Dambre , Francis wyffels

* Submitted for International Conference on Learning Representations 2017 

   Access Paper or Ask Questions

BRUNO: A Deep Recurrent Model for Exchangeable Data



Iryna Korshunova , Jonas Degrave , Ferenc Huszár , Yarin Gal , Arthur Gretton , Joni Dambre

* NIPS 2018 

   Access Paper or Ask Questions

Oncilla robot: a versatile open-source quadruped research robot with compliant pantograph legs



Alexander Spröwitz , Alexandre Tuleu , Mostafa Ajallooeian , Massimo Vespignani , Rico Moeckel , Peter Eckert , Michiel D'Haene , Jonas Degrave , Arne Nordmann , Benjamin Schrauwen , Jochen Steil , Auke Jan Ijspeert

* Front. Robot. AI 5:67 (2018) 

   Access Paper or Ask Questions

Learning by Playing - Solving Sparse Reward Tasks from Scratch



Martin Riedmiller , Roland Hafner , Thomas Lampe , Michael Neunert , Jonas Degrave , Tom Van de Wiele , Volodymyr Mnih , Nicolas Heess , Jost Tobias Springenberg

* A video of the rich set of learned behaviours can be found at https://youtu.be/mPKyvocNe_M 

   Access Paper or Ask Questions

Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks



Fréderic Godin , Jonas Degrave , Joni Dambre , Wesley De Neve


   Access Paper or Ask Questions

1
2
>>