Alert button
Picture for Misha Denil

Misha Denil

Alert button

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Add code
Bookmark button
Alert button
Sep 03, 2019
Tom Le Paine, Caglar Gulcehre, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team

Figure 1 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 2 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 3 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 4 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Viaarxiv icon

Hyperbolic Attention Networks

Add code
Bookmark button
Alert button
May 24, 2018
Caglar Gulcehre, Misha Denil, Mateusz Malinowski, Ali Razavi, Razvan Pascanu, Karl Moritz Hermann, Peter Battaglia, Victor Bapst, David Raposo, Adam Santoro, Nando de Freitas

Figure 1 for Hyperbolic Attention Networks
Figure 2 for Hyperbolic Attention Networks
Figure 3 for Hyperbolic Attention Networks
Figure 4 for Hyperbolic Attention Networks
Viaarxiv icon

Learning Awareness Models

Add code
Bookmark button
Alert button
Apr 17, 2018
Brandon Amos, Laurent Dinh, Serkan Cabi, Thomas Rothörl, Sergio Gómez Colmenarejo, Alistair Muldal, Tom Erez, Yuval Tassa, Nando de Freitas, Misha Denil

Figure 1 for Learning Awareness Models
Figure 2 for Learning Awareness Models
Figure 3 for Learning Awareness Models
Figure 4 for Learning Awareness Models
Viaarxiv icon

Learned Optimizers that Scale and Generalize

Add code
Bookmark button
Alert button
Sep 07, 2017
Olga Wichrowska, Niru Maheswaranathan, Matthew W. Hoffman, Sergio Gomez Colmenarejo, Misha Denil, Nando de Freitas, Jascha Sohl-Dickstein

Figure 1 for Learned Optimizers that Scale and Generalize
Figure 2 for Learned Optimizers that Scale and Generalize
Figure 3 for Learned Optimizers that Scale and Generalize
Figure 4 for Learned Optimizers that Scale and Generalize
Viaarxiv icon

Learning to Perform Physics Experiments via Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 17, 2017
Misha Denil, Pulkit Agrawal, Tejas D Kulkarni, Tom Erez, Peter Battaglia, Nando de Freitas

Figure 1 for Learning to Perform Physics Experiments via Deep Reinforcement Learning
Figure 2 for Learning to Perform Physics Experiments via Deep Reinforcement Learning
Figure 3 for Learning to Perform Physics Experiments via Deep Reinforcement Learning
Viaarxiv icon

The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously

Add code
Bookmark button
Alert button
Jul 11, 2017
Serkan Cabi, Sergio Gómez Colmenarejo, Matthew W. Hoffman, Misha Denil, Ziyu Wang, Nando de Freitas

Figure 1 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 2 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 3 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 4 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Viaarxiv icon

Programmable Agents

Add code
Bookmark button
Alert button
Jun 20, 2017
Misha Denil, Sergio Gómez Colmenarejo, Serkan Cabi, David Saxton, Nando de Freitas

Figure 1 for Programmable Agents
Figure 2 for Programmable Agents
Figure 3 for Programmable Agents
Figure 4 for Programmable Agents
Viaarxiv icon

Learning to Learn without Gradient Descent by Gradient Descent

Add code
Bookmark button
Alert button
Jun 12, 2017
Yutian Chen, Matthew W. Hoffman, Sergio Gomez Colmenarejo, Misha Denil, Timothy P. Lillicrap, Matt Botvinick, Nando de Freitas

Figure 1 for Learning to Learn without Gradient Descent by Gradient Descent
Figure 2 for Learning to Learn without Gradient Descent by Gradient Descent
Figure 3 for Learning to Learn without Gradient Descent by Gradient Descent
Figure 4 for Learning to Learn without Gradient Descent by Gradient Descent
Viaarxiv icon

Learning to Navigate in Complex Environments

Add code
Bookmark button
Alert button
Jan 13, 2017
Piotr Mirowski, Razvan Pascanu, Fabio Viola, Hubert Soyer, Andrew J. Ballard, Andrea Banino, Misha Denil, Ross Goroshin, Laurent Sifre, Koray Kavukcuoglu, Dharshan Kumaran, Raia Hadsell

Figure 1 for Learning to Navigate in Complex Environments
Figure 2 for Learning to Navigate in Complex Environments
Figure 3 for Learning to Navigate in Complex Environments
Figure 4 for Learning to Navigate in Complex Environments
Viaarxiv icon

Learning to learn by gradient descent by gradient descent

Add code
Bookmark button
Alert button
Nov 30, 2016
Marcin Andrychowicz, Misha Denil, Sergio Gomez, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, Nando de Freitas

Figure 1 for Learning to learn by gradient descent by gradient descent
Figure 2 for Learning to learn by gradient descent by gradient descent
Figure 3 for Learning to learn by gradient descent by gradient descent
Figure 4 for Learning to learn by gradient descent by gradient descent
Viaarxiv icon