Alert button
Picture for Nicolas Heess

Nicolas Heess

Alert button

Relative Entropy Regularized Policy Iteration

Add code
Bookmark button
Alert button
Dec 05, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin Riedmiller

Figure 1 for Relative Entropy Regularized Policy Iteration
Figure 2 for Relative Entropy Regularized Policy Iteration
Figure 3 for Relative Entropy Regularized Policy Iteration
Figure 4 for Relative Entropy Regularized Policy Iteration
Viaarxiv icon

Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction

Add code
Bookmark button
Alert button
Dec 05, 2018
Jonathan J Hunt, Andre Barreto, Timothy P Lillicrap, Nicolas Heess

Figure 1 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 2 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 3 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Figure 4 for Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction
Viaarxiv icon

Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures

Add code
Bookmark button
Alert button
Dec 04, 2018
Jonathan Uesato, Ananya Kumar, Csaba Szepesvari, Tom Erez, Avraham Ruderman, Keith Anderson, Krishmamurthy, Dvijotham, Nicolas Heess, Pushmeet Kohli

Figure 1 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 2 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 3 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 4 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Viaarxiv icon

Neural probabilistic motor primitives for humanoid control

Add code
Bookmark button
Alert button
Nov 28, 2018
Josh Merel, Leonard Hasenclever, Alexandre Galashov, Arun Ahuja, Vu Pham, Greg Wayne, Yee Whye Teh, Nicolas Heess

Figure 1 for Neural probabilistic motor primitives for humanoid control
Figure 2 for Neural probabilistic motor primitives for humanoid control
Figure 3 for Neural probabilistic motor primitives for humanoid control
Figure 4 for Neural probabilistic motor primitives for humanoid control
Viaarxiv icon

Hierarchical visuomotor control of humanoids

Add code
Bookmark button
Alert button
Nov 23, 2018
Josh Merel, Arun Ahuja, Vu Pham, Saran Tunyasuvunakool, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Greg Wayne

Figure 1 for Hierarchical visuomotor control of humanoids
Figure 2 for Hierarchical visuomotor control of humanoids
Figure 3 for Hierarchical visuomotor control of humanoids
Figure 4 for Hierarchical visuomotor control of humanoids
Viaarxiv icon

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

Add code
Bookmark button
Alert button
Nov 15, 2018
Lars Buesing, Theophane Weber, Yori Zwols, Sebastien Racaniere, Arthur Guez, Jean-Baptiste Lespiau, Nicolas Heess

Figure 1 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Figure 2 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Figure 3 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Figure 4 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Viaarxiv icon

Relational inductive biases, deep learning, and graph networks

Add code
Bookmark button
Alert button
Oct 17, 2018
Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu

Figure 1 for Relational inductive biases, deep learning, and graph networks
Figure 2 for Relational inductive biases, deep learning, and graph networks
Figure 3 for Relational inductive biases, deep learning, and graph networks
Figure 4 for Relational inductive biases, deep learning, and graph networks
Viaarxiv icon

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

Add code
Bookmark button
Alert button
Oct 08, 2018
Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

Figure 1 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 2 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 3 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 4 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Viaarxiv icon

Unsupervised Learning of 3D Structure from Images

Add code
Bookmark button
Alert button
Jun 19, 2018
Danilo Jimenez Rezende, S. M. Ali Eslami, Shakir Mohamed, Peter Battaglia, Max Jaderberg, Nicolas Heess

Figure 1 for Unsupervised Learning of 3D Structure from Images
Figure 2 for Unsupervised Learning of 3D Structure from Images
Figure 3 for Unsupervised Learning of 3D Structure from Images
Figure 4 for Unsupervised Learning of 3D Structure from Images
Viaarxiv icon

Maximum a Posteriori Policy Optimisation

Add code
Bookmark button
Alert button
Jun 14, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Remi Munos, Nicolas Heess, Martin Riedmiller

Figure 1 for Maximum a Posteriori Policy Optimisation
Figure 2 for Maximum a Posteriori Policy Optimisation
Figure 3 for Maximum a Posteriori Policy Optimisation
Figure 4 for Maximum a Posteriori Policy Optimisation
Viaarxiv icon