Alert button
Picture for Diana Borsa

Diana Borsa

Alert button

A State Representation for Diminishing Rewards

Add code
Bookmark button
Alert button
Sep 07, 2023
Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani

Figure 1 for A State Representation for Diminishing Rewards
Figure 2 for A State Representation for Diminishing Rewards
Figure 3 for A State Representation for Diminishing Rewards
Figure 4 for A State Representation for Diminishing Rewards
Viaarxiv icon

Generalised Policy Improvement with Geometric Policy Composition

Add code
Bookmark button
Alert button
Jun 17, 2022
Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Rémi Munos, André Barreto

Figure 1 for Generalised Policy Improvement with Geometric Policy Composition
Figure 2 for Generalised Policy Improvement with Geometric Policy Composition
Figure 3 for Generalised Policy Improvement with Geometric Policy Composition
Figure 4 for Generalised Policy Improvement with Geometric Policy Composition
Viaarxiv icon

Selective Credit Assignment

Add code
Bookmark button
Alert button
Feb 20, 2022
Veronica Chelu, Diana Borsa, Doina Precup, Hado van Hasselt

Figure 1 for Selective Credit Assignment
Figure 2 for Selective Credit Assignment
Figure 3 for Selective Credit Assignment
Figure 4 for Selective Credit Assignment
Viaarxiv icon

Model-Value Inconsistency as a Signal for Epistemic Uncertainty

Add code
Bookmark button
Alert button
Dec 08, 2021
Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero

Figure 1 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Figure 2 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Figure 3 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Figure 4 for Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Viaarxiv icon

When should agents explore?

Add code
Bookmark button
Alert button
Aug 26, 2021
Miruna Pîslar, David Szepesvari, Georg Ostrovski, Diana Borsa, Tom Schaul

Figure 1 for When should agents explore?
Figure 2 for When should agents explore?
Figure 3 for When should agents explore?
Figure 4 for When should agents explore?
Viaarxiv icon

The Option Keyboard: Combining Skills in Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 24, 2021
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan Hunt, Shibl Mourad, David Silver, Doina Precup

Figure 1 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 2 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 3 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 4 for The Option Keyboard: Combining Skills in Reinforcement Learning
Viaarxiv icon

Return-based Scaling: Yet Another Normalisation Trick for Deep RL

Add code
Bookmark button
Alert button
May 11, 2021
Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa

Figure 1 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 2 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 3 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Figure 4 for Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Viaarxiv icon

Expected Eligibility Traces

Add code
Bookmark button
Alert button
Jul 03, 2020
Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa

Figure 1 for Expected Eligibility Traces
Figure 2 for Expected Eligibility Traces
Figure 3 for Expected Eligibility Traces
Figure 4 for Expected Eligibility Traces
Viaarxiv icon

Adapting Behaviour for Learning Progress

Add code
Bookmark button
Alert button
Dec 14, 2019
Tom Schaul, Diana Borsa, David Ding, David Szepesvari, Georg Ostrovski, Will Dabney, Simon Osindero

Figure 1 for Adapting Behaviour for Learning Progress
Figure 2 for Adapting Behaviour for Learning Progress
Figure 3 for Adapting Behaviour for Learning Progress
Figure 4 for Adapting Behaviour for Learning Progress
Viaarxiv icon

Conditional Importance Sampling for Off-Policy Learning

Add code
Bookmark button
Alert button
Oct 16, 2019
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney

Figure 1 for Conditional Importance Sampling for Off-Policy Learning
Figure 2 for Conditional Importance Sampling for Off-Policy Learning
Figure 3 for Conditional Importance Sampling for Off-Policy Learning
Figure 4 for Conditional Importance Sampling for Off-Policy Learning
Viaarxiv icon