Picture for Nando de Freitas

Nando de Freitas

University of British Columbia

Learning to Perform Physics Experiments via Deep Reinforcement Learning

Add code
Aug 17, 2017
Figure 1 for Learning to Perform Physics Experiments via Deep Reinforcement Learning
Figure 2 for Learning to Perform Physics Experiments via Deep Reinforcement Learning
Figure 3 for Learning to Perform Physics Experiments via Deep Reinforcement Learning
Viaarxiv icon

Robust Imitation of Diverse Behaviors

Add code
Jul 14, 2017
Figure 1 for Robust Imitation of Diverse Behaviors
Figure 2 for Robust Imitation of Diverse Behaviors
Figure 3 for Robust Imitation of Diverse Behaviors
Figure 4 for Robust Imitation of Diverse Behaviors
Viaarxiv icon

The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously

Add code
Jul 11, 2017
Figure 1 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 2 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 3 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 4 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Viaarxiv icon

Sample Efficient Actor-Critic with Experience Replay

Add code
Jul 10, 2017
Figure 1 for Sample Efficient Actor-Critic with Experience Replay
Figure 2 for Sample Efficient Actor-Critic with Experience Replay
Figure 3 for Sample Efficient Actor-Critic with Experience Replay
Figure 4 for Sample Efficient Actor-Critic with Experience Replay
Viaarxiv icon

Programmable Agents

Add code
Jun 20, 2017
Figure 1 for Programmable Agents
Figure 2 for Programmable Agents
Figure 3 for Programmable Agents
Figure 4 for Programmable Agents
Viaarxiv icon

Learning to Learn without Gradient Descent by Gradient Descent

Add code
Jun 12, 2017
Figure 1 for Learning to Learn without Gradient Descent by Gradient Descent
Figure 2 for Learning to Learn without Gradient Descent by Gradient Descent
Figure 3 for Learning to Learn without Gradient Descent by Gradient Descent
Figure 4 for Learning to Learn without Gradient Descent by Gradient Descent
Viaarxiv icon

Parallel Multiscale Autoregressive Density Estimation

Add code
Mar 10, 2017
Figure 1 for Parallel Multiscale Autoregressive Density Estimation
Figure 2 for Parallel Multiscale Autoregressive Density Estimation
Figure 3 for Parallel Multiscale Autoregressive Density Estimation
Figure 4 for Parallel Multiscale Autoregressive Density Estimation
Viaarxiv icon

LipNet: End-to-End Sentence-level Lipreading

Add code
Dec 16, 2016
Figure 1 for LipNet: End-to-End Sentence-level Lipreading
Figure 2 for LipNet: End-to-End Sentence-level Lipreading
Figure 3 for LipNet: End-to-End Sentence-level Lipreading
Figure 4 for LipNet: End-to-End Sentence-level Lipreading
Viaarxiv icon

Learning to learn by gradient descent by gradient descent

Add code
Nov 30, 2016
Figure 1 for Learning to learn by gradient descent by gradient descent
Figure 2 for Learning to learn by gradient descent by gradient descent
Figure 3 for Learning to learn by gradient descent by gradient descent
Figure 4 for Learning to learn by gradient descent by gradient descent
Viaarxiv icon

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Add code
May 24, 2016
Figure 1 for Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Figure 2 for Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Figure 3 for Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Figure 4 for Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Viaarxiv icon