Picture for David Silver

David Silver

University College London

Asynchronous Methods for Deep Reinforcement Learning

Add code
Jun 16, 2016
Figure 1 for Asynchronous Methods for Deep Reinforcement Learning
Figure 2 for Asynchronous Methods for Deep Reinforcement Learning
Figure 3 for Asynchronous Methods for Deep Reinforcement Learning
Figure 4 for Asynchronous Methods for Deep Reinforcement Learning
Viaarxiv icon

Continuous control with deep reinforcement learning

Add code
Feb 29, 2016
Figure 1 for Continuous control with deep reinforcement learning
Figure 2 for Continuous control with deep reinforcement learning
Figure 3 for Continuous control with deep reinforcement learning
Figure 4 for Continuous control with deep reinforcement learning
Viaarxiv icon

Prioritized Experience Replay

Add code
Feb 25, 2016
Figure 1 for Prioritized Experience Replay
Figure 2 for Prioritized Experience Replay
Figure 3 for Prioritized Experience Replay
Figure 4 for Prioritized Experience Replay
Viaarxiv icon

Memory-based control with recurrent neural networks

Add code
Dec 14, 2015
Figure 1 for Memory-based control with recurrent neural networks
Figure 2 for Memory-based control with recurrent neural networks
Figure 3 for Memory-based control with recurrent neural networks
Viaarxiv icon

Deep Reinforcement Learning with Double Q-learning

Add code
Dec 08, 2015
Figure 1 for Deep Reinforcement Learning with Double Q-learning
Figure 2 for Deep Reinforcement Learning with Double Q-learning
Figure 3 for Deep Reinforcement Learning with Double Q-learning
Figure 4 for Deep Reinforcement Learning with Double Q-learning
Viaarxiv icon

Learning Continuous Control Policies by Stochastic Value Gradients

Add code
Oct 30, 2015
Figure 1 for Learning Continuous Control Policies by Stochastic Value Gradients
Figure 2 for Learning Continuous Control Policies by Stochastic Value Gradients
Figure 3 for Learning Continuous Control Policies by Stochastic Value Gradients
Figure 4 for Learning Continuous Control Policies by Stochastic Value Gradients
Viaarxiv icon

Massively Parallel Methods for Deep Reinforcement Learning

Add code
Jul 16, 2015
Figure 1 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 2 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 3 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 4 for Massively Parallel Methods for Deep Reinforcement Learning
Viaarxiv icon

Move Evaluation in Go Using Deep Convolutional Neural Networks

Add code
Apr 10, 2015
Figure 1 for Move Evaluation in Go Using Deep Convolutional Neural Networks
Figure 2 for Move Evaluation in Go Using Deep Convolutional Neural Networks
Figure 3 for Move Evaluation in Go Using Deep Convolutional Neural Networks
Viaarxiv icon

Value Iteration with Options and State Aggregation

Add code
Jan 16, 2015
Figure 1 for Value Iteration with Options and State Aggregation
Figure 2 for Value Iteration with Options and State Aggregation
Figure 3 for Value Iteration with Options and State Aggregation
Viaarxiv icon

Unit Tests for Stochastic Optimization

Add code
Feb 25, 2014
Figure 1 for Unit Tests for Stochastic Optimization
Figure 2 for Unit Tests for Stochastic Optimization
Figure 3 for Unit Tests for Stochastic Optimization
Figure 4 for Unit Tests for Stochastic Optimization
Viaarxiv icon