Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

The Phenomenon of Policy Churn


Jun 09, 2022
Tom Schaul, André Barreto, John Quan, Georg Ostrovski


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Model-Value Inconsistency as a Signal for Epistemic Uncertainty


Dec 08, 2021
Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero

* The first three authors contributed equally 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

When should agents explore?


Aug 26, 2021
Miruna Pîslar, David Szepesvari, Georg Ostrovski, Diana Borsa, Tom Schaul


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Return-based Scaling: Yet Another Normalisation Trick for Deep RL


May 11, 2021
Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Policy Evaluation Networks


Feb 26, 2020
Jean Harb, Tom Schaul, Doina Precup, Pierre-Luc Bacon

* 12 pages, 11 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Adapting Behaviour for Learning Progress


Dec 14, 2019
Tom Schaul, Diana Borsa, David Ding, David Szepesvari, Georg Ostrovski, Will Dabney, Simon Osindero


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Conditional Importance Sampling for Off-Policy Learning


Oct 16, 2019
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, RĂ©mi Munos, Will Dabney


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods


Jun 07, 2019
Karel Lenc, Erich Elsen, Tom Schaul, Karen Simonyan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Ray Interference: a Source of Plateaus in Deep Reinforcement Learning


Apr 25, 2019
Tom Schaul, Diana Borsa, Joseph Modayil, Razvan Pascanu

* Full version of RLDM abstract 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement


Jan 30, 2019
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin Žídek, Rémi Munos

* Published at ICML 2018 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
>>