Alert button
Picture for Simon Schmitt

Simon Schmitt

Alert button

Exploration via Epistemic Value Estimation

Mar 07, 2023
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

Figure 1 for Exploration via Epistemic Value Estimation
Figure 2 for Exploration via Epistemic Value Estimation
Figure 3 for Exploration via Epistemic Value Estimation
Figure 4 for Exploration via Epistemic Value Estimation
Viaarxiv icon

Chaining Value Functions for Off-Policy Learning

Feb 02, 2022
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

Figure 1 for Chaining Value Functions for Off-Policy Learning
Figure 2 for Chaining Value Functions for Off-Policy Learning
Figure 3 for Chaining Value Functions for Off-Policy Learning
Figure 4 for Chaining Value Functions for Off-Policy Learning
Viaarxiv icon

Learning and Planning in Complex Action Spaces

Apr 13, 2021
Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Mohammadamin Barekatain, Simon Schmitt, David Silver

Figure 1 for Learning and Planning in Complex Action Spaces
Figure 2 for Learning and Planning in Complex Action Spaces
Figure 3 for Learning and Planning in Complex Action Spaces
Figure 4 for Learning and Planning in Complex Action Spaces
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Apr 13, 2021
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt

Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

AlgebraNets

Jun 16, 2020
Jordan Hoffmann, Simon Schmitt, Simon Osindero, Karen Simonyan, Erich Elsen

Figure 1 for AlgebraNets
Figure 2 for AlgebraNets
Figure 3 for AlgebraNets
Figure 4 for AlgebraNets
Viaarxiv icon

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Nov 19, 2019
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy Lillicrap, David Silver

Figure 1 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 2 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 3 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 4 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Viaarxiv icon

Gated Linear Networks

Sep 30, 2019
Joel Veness, Tor Lattimore, Avishkar Bhoopchand, David Budden, Christopher Mattern, Agnieszka Grabska-Barwinska, Peter Toth, Simon Schmitt, Marcus Hutter

Figure 1 for Gated Linear Networks
Figure 2 for Gated Linear Networks
Figure 3 for Gated Linear Networks
Viaarxiv icon

Off-Policy Actor-Critic with Shared Experience Replay

Sep 25, 2019
Simon Schmitt, Matteo Hessel, Karen Simonyan

Figure 1 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 2 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 3 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 4 for Off-Policy Actor-Critic with Shared Experience Replay
Viaarxiv icon