Alert button
Picture for Doina Precup

Doina Precup

Alert button

Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 11, 2019
Riashat Islam, Raihan Seraj, Samin Yeasar Arnob, Doina Precup

Figure 1 for Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
Figure 2 for Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
Figure 3 for Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
Viaarxiv icon

Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods

Add code
Bookmark button
Alert button
Dec 11, 2019
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon, Doina Precup

Figure 1 for Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Figure 2 for Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Figure 3 for Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Figure 4 for Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Viaarxiv icon

Hindsight Credit Assignment

Add code
Bookmark button
Alert button
Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

Figure 1 for Hindsight Credit Assignment
Figure 2 for Hindsight Credit Assignment
Figure 3 for Hindsight Credit Assignment
Figure 4 for Hindsight Credit Assignment
Viaarxiv icon

Option-critic in cooperative multi-agent systems

Add code
Bookmark button
Alert button
Nov 28, 2019
Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup

Figure 1 for Option-critic in cooperative multi-agent systems
Figure 2 for Option-critic in cooperative multi-agent systems
Figure 3 for Option-critic in cooperative multi-agent systems
Figure 4 for Option-critic in cooperative multi-agent systems
Viaarxiv icon

Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction

Add code
Bookmark button
Alert button
Nov 28, 2019
Vishal Jain, William Fedus, Hugo Larochelle, Doina Precup, Marc G. Bellemare

Figure 1 for Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction
Figure 2 for Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction
Figure 3 for Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction
Figure 4 for Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction
Viaarxiv icon

Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning

Add code
Bookmark button
Alert button
Nov 22, 2019
Tianyu Li, Bogdan Mazoure, Doina Precup, Guillaume Rabusseau

Figure 1 for Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Figure 2 for Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Figure 3 for Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Viaarxiv icon

Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments

Add code
Bookmark button
Alert button
Oct 29, 2019
Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira E. Kahou, Joseph P. Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo, Chris Pal

Figure 1 for Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Figure 2 for Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Figure 3 for Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Figure 4 for Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Viaarxiv icon

Actor Critic with Differentially Private Critic

Add code
Bookmark button
Alert button
Oct 14, 2019
Jonathan Lebensold, William Hamilton, Borja Balle, Doina Precup

Figure 1 for Actor Critic with Differentially Private Critic
Figure 2 for Actor Critic with Differentially Private Critic
Viaarxiv icon

Augmenting learning using symmetry in a biologically-inspired domain

Add code
Bookmark button
Alert button
Oct 01, 2019
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim, Doina Precup

Figure 1 for Augmenting learning using symmetry in a biologically-inspired domain
Figure 2 for Augmenting learning using symmetry in a biologically-inspired domain
Viaarxiv icon