Alert button
Picture for Tom Schaul

Tom Schaul

Alert button

Vision-Language Models as a Source of Rewards

Add code
Bookmark button
Alert button
Dec 14, 2023
Kate Baumli, Satinder Baveja, Feryal Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang

Viaarxiv icon

Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization

Add code
Bookmark button
Alert button
Apr 08, 2023
Robert Tjarko Lange, Tom Schaul, Yutian Chen, Chris Lu, Tom Zahavy, Valentin Dalibard, Sebastian Flennerhag

Figure 1 for Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
Figure 2 for Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
Figure 3 for Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
Figure 4 for Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
Viaarxiv icon

Scaling Goal-based Exploration via Pruning Proto-goals

Add code
Bookmark button
Alert button
Feb 09, 2023
Akhil Bagaria, Ray Jiang, Ramana Kumar, Tom Schaul

Figure 1 for Scaling Goal-based Exploration via Pruning Proto-goals
Figure 2 for Scaling Goal-based Exploration via Pruning Proto-goals
Figure 3 for Scaling Goal-based Exploration via Pruning Proto-goals
Figure 4 for Scaling Goal-based Exploration via Pruning Proto-goals
Viaarxiv icon

Discovering Evolution Strategies via Meta-Black-Box Optimization

Add code
Bookmark button
Alert button
Nov 25, 2022
Robert Tjarko Lange, Tom Schaul, Yutian Chen, Tom Zahavy, Valentin Dallibard, Chris Lu, Satinder Singh, Sebastian Flennerhag

Figure 1 for Discovering Evolution Strategies via Meta-Black-Box Optimization
Figure 2 for Discovering Evolution Strategies via Meta-Black-Box Optimization
Figure 3 for Discovering Evolution Strategies via Meta-Black-Box Optimization
Figure 4 for Discovering Evolution Strategies via Meta-Black-Box Optimization
Viaarxiv icon

The Phenomenon of Policy Churn

Add code
Bookmark button
Alert button
Jun 09, 2022
Tom Schaul, André Barreto, John Quan, Georg Ostrovski

Figure 1 for The Phenomenon of Policy Churn
Figure 2 for The Phenomenon of Policy Churn
Figure 3 for The Phenomenon of Policy Churn
Figure 4 for The Phenomenon of Policy Churn
Viaarxiv icon

Model-Value Inconsistency as a Signal for Epistemic Uncertainty

Add code
Bookmark button
Alert button
Dec 08, 2021
Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero

Figure 1 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Figure 2 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Figure 3 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Figure 4 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Viaarxiv icon

When should agents explore?

Add code
Bookmark button
Alert button
Aug 26, 2021
Miruna Pîslar, David Szepesvari, Georg Ostrovski, Diana Borsa, Tom Schaul

Figure 1 for When should agents explore?
Figure 2 for When should agents explore?
Figure 3 for When should agents explore?
Figure 4 for When should agents explore?
Viaarxiv icon

Return-based Scaling: Yet Another Normalisation Trick for Deep RL

Add code
Bookmark button
Alert button
May 11, 2021
Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa

Figure 1 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 2 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 3 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 4 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Viaarxiv icon

Policy Evaluation Networks

Add code
Bookmark button
Alert button
Feb 26, 2020
Jean Harb, Tom Schaul, Doina Precup, Pierre-Luc Bacon

Figure 1 for Policy Evaluation Networks
Figure 2 for Policy Evaluation Networks
Figure 3 for Policy Evaluation Networks
Figure 4 for Policy Evaluation Networks
Viaarxiv icon