Alert button
Picture for Georg Ostrovski

Georg Ostrovski

Alert button

Deep Reinforcement Learning with Plasticity Injection

Add code
Bookmark button
Alert button
May 24, 2023
Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto

Figure 1 for Deep Reinforcement Learning with Plasticity Injection
Figure 2 for Deep Reinforcement Learning with Plasticity Injection
Figure 3 for Deep Reinforcement Learning with Plasticity Injection
Figure 4 for Deep Reinforcement Learning with Plasticity Injection
Viaarxiv icon

An Analysis of Quantile Temporal-Difference Learning

Add code
Bookmark button
Alert button
Jan 11, 2023
Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney

Figure 1 for An Analysis of Quantile Temporal-Difference Learning
Figure 2 for An Analysis of Quantile Temporal-Difference Learning
Figure 3 for An Analysis of Quantile Temporal-Difference Learning
Figure 4 for An Analysis of Quantile Temporal-Difference Learning
Viaarxiv icon

An Empirical Study of Implicit Regularization in Deep Offline RL

Add code
Bookmark button
Alert button
Jul 07, 2022
Caglar Gulcehre, Srivatsan Srinivasan, Jakub Sygnowski, Georg Ostrovski, Mehrdad Farajtabar, Matt Hoffman, Razvan Pascanu, Arnaud Doucet

Figure 1 for An Empirical Study of Implicit Regularization in Deep Offline RL
Figure 2 for An Empirical Study of Implicit Regularization in Deep Offline RL
Figure 3 for An Empirical Study of Implicit Regularization in Deep Offline RL
Figure 4 for An Empirical Study of Implicit Regularization in Deep Offline RL
Viaarxiv icon

The Phenomenon of Policy Churn

Add code
Bookmark button
Alert button
Jun 09, 2022
Tom Schaul, André Barreto, John Quan, Georg Ostrovski

Figure 1 for The Phenomenon of Policy Churn
Figure 2 for The Phenomenon of Policy Churn
Figure 3 for The Phenomenon of Policy Churn
Figure 4 for The Phenomenon of Policy Churn
Viaarxiv icon

The Difficulty of Passive Learning in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 26, 2021
Georg Ostrovski, Pablo Samuel Castro, Will Dabney

Figure 1 for The Difficulty of Passive Learning in Deep Reinforcement Learning
Figure 2 for The Difficulty of Passive Learning in Deep Reinforcement Learning
Figure 3 for The Difficulty of Passive Learning in Deep Reinforcement Learning
Figure 4 for The Difficulty of Passive Learning in Deep Reinforcement Learning
Viaarxiv icon

When should agents explore?

Add code
Bookmark button
Alert button
Aug 26, 2021
Miruna Pîslar, David Szepesvari, Georg Ostrovski, Diana Borsa, Tom Schaul

Figure 1 for When should agents explore?
Figure 2 for When should agents explore?
Figure 3 for When should agents explore?
Figure 4 for When should agents explore?
Viaarxiv icon

Return-based Scaling: Yet Another Normalisation Trick for Deep RL

Add code
Bookmark button
Alert button
May 11, 2021
Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa

Figure 1 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 2 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 3 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 4 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Viaarxiv icon

On The Effect of Auxiliary Tasks on Representation Dynamics

Add code
Bookmark button
Alert button
Feb 25, 2021
Clare Lyle, Mark Rowland, Georg Ostrovski, Will Dabney

Figure 1 for On The Effect of Auxiliary Tasks on Representation Dynamics
Figure 2 for On The Effect of Auxiliary Tasks on Representation Dynamics
Figure 3 for On The Effect of Auxiliary Tasks on Representation Dynamics
Figure 4 for On The Effect of Auxiliary Tasks on Representation Dynamics
Viaarxiv icon

Temporally-Extended ε-Greedy Exploration

Add code
Bookmark button
Alert button
Jun 02, 2020
Will Dabney, Georg Ostrovski, André Barreto

Figure 1 for Temporally-Extended ε-Greedy Exploration
Figure 2 for Temporally-Extended ε-Greedy Exploration
Figure 3 for Temporally-Extended ε-Greedy Exploration
Figure 4 for Temporally-Extended ε-Greedy Exploration
Viaarxiv icon

Adapting Behaviour for Learning Progress

Add code
Bookmark button
Alert button
Dec 14, 2019
Tom Schaul, Diana Borsa, David Ding, David Szepesvari, Georg Ostrovski, Will Dabney, Simon Osindero

Figure 1 for Adapting Behaviour for Learning Progress
Figure 2 for Adapting Behaviour for Learning Progress
Figure 3 for Adapting Behaviour for Learning Progress
Figure 4 for Adapting Behaviour for Learning Progress
Viaarxiv icon