Alert button
Picture for John Aslanides

John Aslanides

Alert button

Behaviour Suite for Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 09, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt

Figure 1 for Behaviour Suite for Reinforcement Learning
Figure 2 for Behaviour Suite for Reinforcement Learning
Figure 3 for Behaviour Suite for Reinforcement Learning
Figure 4 for Behaviour Suite for Reinforcement Learning
Viaarxiv icon

When to use parametric models in reinforcement learning?

Add code
Bookmark button
Alert button
Jun 12, 2019
Hado van Hasselt, Matteo Hessel, John Aslanides

Figure 1 for When to use parametric models in reinforcement learning?
Figure 2 for When to use parametric models in reinforcement learning?
Figure 3 for When to use parametric models in reinforcement learning?
Figure 4 for When to use parametric models in reinforcement learning?
Viaarxiv icon

TF-Replicator: Distributed Machine Learning for Researchers

Add code
Bookmark button
Alert button
Feb 01, 2019
Peter Buchlovsky, David Budden, Dominik Grewe, Chris Jones, John Aslanides, Frederic Besse, Andy Brock, Aidan Clark, Sergio Gómez Colmenarejo, Aedan Pope, Fabio Viola, Dan Belov

Figure 1 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 2 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 3 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 4 for TF-Replicator: Distributed Machine Learning for Researchers
Viaarxiv icon

Randomized Prior Functions for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 08, 2018
Ian Osband, John Aslanides, Albin Cassirer

Figure 1 for Randomized Prior Functions for Deep Reinforcement Learning
Figure 2 for Randomized Prior Functions for Deep Reinforcement Learning
Figure 3 for Randomized Prior Functions for Deep Reinforcement Learning
Figure 4 for Randomized Prior Functions for Deep Reinforcement Learning
Viaarxiv icon

Universal Reinforcement Learning Algorithms: Survey and Experiments

Add code
Bookmark button
Alert button
May 30, 2017
John Aslanides, Jan Leike, Marcus Hutter

Figure 1 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Figure 2 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Figure 3 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Figure 4 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Viaarxiv icon

AIXIjs: A Software Demo for General Reinforcement Learning

Add code
Bookmark button
Alert button
May 22, 2017
John Aslanides

Figure 1 for AIXIjs: A Software Demo for General Reinforcement Learning
Figure 2 for AIXIjs: A Software Demo for General Reinforcement Learning
Figure 3 for AIXIjs: A Software Demo for General Reinforcement Learning
Figure 4 for AIXIjs: A Software Demo for General Reinforcement Learning
Viaarxiv icon

Generalised Discount Functions applied to a Monte-Carlo AImu Implementation

Add code
Bookmark button
Alert button
Mar 03, 2017
Sean Lamont, John Aslanides, Jan Leike, Marcus Hutter

Figure 1 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Figure 2 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Figure 3 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Figure 4 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Viaarxiv icon