Picture for Doina Precup

Doina Precup

McGill University, Mila- Quebec Artificial Intelligence Institute

Learning to cooperate: Emergent communication in multi-agent navigation

Add code
Apr 02, 2020
Figure 1 for Learning to cooperate: Emergent communication in multi-agent navigation
Figure 2 for Learning to cooperate: Emergent communication in multi-agent navigation
Figure 3 for Learning to cooperate: Emergent communication in multi-agent navigation
Figure 4 for Learning to cooperate: Emergent communication in multi-agent navigation
Viaarxiv icon

A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Add code
Mar 27, 2020
Figure 1 for A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms
Viaarxiv icon

Interference and Generalization in Temporal Difference Learning

Add code
Mar 13, 2020
Figure 1 for Interference and Generalization in Temporal Difference Learning
Figure 2 for Interference and Generalization in Temporal Difference Learning
Figure 3 for Interference and Generalization in Temporal Difference Learning
Figure 4 for Interference and Generalization in Temporal Difference Learning
Viaarxiv icon

Invariant Causal Prediction for Block MDPs

Add code
Mar 12, 2020
Figure 1 for Invariant Causal Prediction for Block MDPs
Figure 2 for Invariant Causal Prediction for Block MDPs
Figure 3 for Invariant Causal Prediction for Block MDPs
Figure 4 for Invariant Causal Prediction for Block MDPs
Viaarxiv icon

Policy Evaluation Networks

Add code
Feb 26, 2020
Figure 1 for Policy Evaluation Networks
Figure 2 for Policy Evaluation Networks
Figure 3 for Policy Evaluation Networks
Figure 4 for Policy Evaluation Networks
Viaarxiv icon

oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions

Add code
Feb 20, 2020
Figure 1 for oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions
Figure 2 for oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions
Figure 3 for oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions
Figure 4 for oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions
Viaarxiv icon

Value-driven Hindsight Modelling

Add code
Feb 19, 2020
Figure 1 for Value-driven Hindsight Modelling
Figure 2 for Value-driven Hindsight Modelling
Figure 3 for Value-driven Hindsight Modelling
Figure 4 for Value-driven Hindsight Modelling
Viaarxiv icon

Provably efficient reconstruction of policy networks

Add code
Feb 07, 2020
Figure 1 for Provably efficient reconstruction of policy networks
Figure 2 for Provably efficient reconstruction of policy networks
Figure 3 for Provably efficient reconstruction of policy networks
Figure 4 for Provably efficient reconstruction of policy networks
Viaarxiv icon

Option-critic in cooperative multi-agent systems

Add code
Jan 06, 2020
Figure 1 for Option-critic in cooperative multi-agent systems
Figure 2 for Option-critic in cooperative multi-agent systems
Figure 3 for Option-critic in cooperative multi-agent systems
Figure 4 for Option-critic in cooperative multi-agent systems
Viaarxiv icon

Options of Interest: Temporal Abstraction with Interest Functions

Add code
Jan 01, 2020
Figure 1 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 2 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 3 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 4 for Options of Interest: Temporal Abstraction with Interest Functions
Viaarxiv icon