Alert button
Picture for Simon Schmitt

Simon Schmitt

Alert button

Exploration via Epistemic Value Estimation

Add code
Bookmark button
Alert button
Mar 07, 2023
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

Figure 1 for Exploration via Epistemic Value Estimation
Figure 2 for Exploration via Epistemic Value Estimation
Figure 3 for Exploration via Epistemic Value Estimation
Figure 4 for Exploration via Epistemic Value Estimation
Viaarxiv icon

Chaining Value Functions for Off-Policy Learning

Add code
Bookmark button
Alert button
Feb 02, 2022
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

Figure 1 for Chaining Value Functions for Off-Policy Learning
Figure 2 for Chaining Value Functions for Off-Policy Learning
Figure 3 for Chaining Value Functions for Off-Policy Learning
Figure 4 for Chaining Value Functions for Off-Policy Learning
Viaarxiv icon

Learning and Planning in Complex Action Spaces

Add code
Bookmark button
Alert button
Apr 13, 2021
Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Mohammadamin Barekatain, Simon Schmitt, David Silver

Figure 1 for Learning and Planning in Complex Action Spaces
Figure 2 for Learning and Planning in Complex Action Spaces
Figure 3 for Learning and Planning in Complex Action Spaces
Figure 4 for Learning and Planning in Complex Action Spaces
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Bookmark button
Alert button
Apr 13, 2021
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt

Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

AlgebraNets

Add code
Bookmark button
Alert button
Jun 16, 2020
Jordan Hoffmann, Simon Schmitt, Simon Osindero, Karen Simonyan, Erich Elsen

Figure 1 for AlgebraNets
Figure 2 for AlgebraNets
Figure 3 for AlgebraNets
Figure 4 for AlgebraNets
Viaarxiv icon

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Add code
Bookmark button
Alert button
Nov 19, 2019
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy Lillicrap, David Silver

Figure 1 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 2 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 3 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 4 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Viaarxiv icon

Gated Linear Networks

Add code
Bookmark button
Alert button
Sep 30, 2019
Joel Veness, Tor Lattimore, Avishkar Bhoopchand, David Budden, Christopher Mattern, Agnieszka Grabska-Barwinska, Peter Toth, Simon Schmitt, Marcus Hutter

Figure 1 for Gated Linear Networks
Figure 2 for Gated Linear Networks
Figure 3 for Gated Linear Networks
Viaarxiv icon

Off-Policy Actor-Critic with Shared Experience Replay

Add code
Bookmark button
Alert button
Sep 25, 2019
Simon Schmitt, Matteo Hessel, Karen Simonyan

Figure 1 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 2 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 3 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 4 for Off-Policy Actor-Critic with Shared Experience Replay
Viaarxiv icon