Alert button
Picture for Doina Precup

Doina Precup

Alert button

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Add code
Bookmark button
Alert button
Jun 12, 2021
Scott Fujimoto, David Meger, Doina Precup

Figure 1 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Figure 2 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Figure 3 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Figure 4 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Viaarxiv icon

Preferential Temporal Difference Learning

Add code
Bookmark button
Alert button
Jun 11, 2021
Nishanth Anand, Doina Precup

Figure 1 for Preferential Temporal Difference Learning
Figure 2 for Preferential Temporal Difference Learning
Figure 3 for Preferential Temporal Difference Learning
Figure 4 for Preferential Temporal Difference Learning
Viaarxiv icon

Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation

Add code
Bookmark button
Alert button
Jun 08, 2021
Emmanuel Bengio, Moksh Jain, Maksym Korablyov, Doina Precup, Yoshua Bengio

Figure 1 for Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Figure 2 for Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Figure 3 for Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Figure 4 for Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Viaarxiv icon

Correcting Momentum in Temporal Difference Learning

Add code
Bookmark button
Alert button
Jun 07, 2021
Emmanuel Bengio, Joelle Pineau, Doina Precup

Figure 1 for Correcting Momentum in Temporal Difference Learning
Figure 2 for Correcting Momentum in Temporal Difference Learning
Figure 3 for Correcting Momentum in Temporal Difference Learning
Figure 4 for Correcting Momentum in Temporal Difference Learning
Viaarxiv icon

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 03, 2021
Mingde Zhao, Zhen Liu, Sitao Luan, Shuyuan Zhang, Doina Precup, Yoshua Bengio

Figure 1 for A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Figure 2 for A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Figure 3 for A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Figure 4 for A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Viaarxiv icon

Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL

Add code
Bookmark button
Alert button
Jun 01, 2021
Bogdan Mazoure, Paul Mineiro, Pavithra Srinath, Reza Sharifi Sedeh, Doina Precup, Adith Swaminathan

Figure 1 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 2 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 3 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 4 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Viaarxiv icon

AndroidEnv: A Reinforcement Learning Platform for Android

Add code
Bookmark button
Alert button
May 27, 2021
Daniel Toyama, Philippe Hamel, Anita Gergely, Gheorghe Comanici, Amelia Glaese, Zafarali Ahmed, Tyler Jackson, Shibl Mourad, Doina Precup

Figure 1 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 2 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 3 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 4 for AndroidEnv: A Reinforcement Learning Platform for Android
Viaarxiv icon

What is Going on Inside Recurrent Meta Reinforcement Learning Agents?

Add code
Bookmark button
Alert button
Apr 29, 2021
Safa Alver, Doina Precup

Figure 1 for What is Going on Inside Recurrent Meta Reinforcement Learning Agents?
Figure 2 for What is Going on Inside Recurrent Meta Reinforcement Learning Agents?
Viaarxiv icon

Training a First-Order Theorem Prover from Synthetic Data

Add code
Bookmark button
Alert button
Mar 05, 2021
Vlad Firoiu, Eser Aygun, Ankit Anand, Zafarali Ahmed, Xavier Glorot, Laurent Orseau, Lei Zhang, Doina Precup, Shibl Mourad

Figure 1 for Training a First-Order Theorem Prover from Synthetic Data
Figure 2 for Training a First-Order Theorem Prover from Synthetic Data
Figure 3 for Training a First-Order Theorem Prover from Synthetic Data
Figure 4 for Training a First-Order Theorem Prover from Synthetic Data
Viaarxiv icon

Variance Penalized On-Policy and Off-Policy Actor-Critic

Add code
Bookmark button
Alert button
Feb 03, 2021
Arushi Jain, Gandharv Patil, Ayush Jain, Khimya Khetarpal, Doina Precup

Figure 1 for Variance Penalized On-Policy and Off-Policy Actor-Critic
Figure 2 for Variance Penalized On-Policy and Off-Policy Actor-Critic
Figure 3 for Variance Penalized On-Policy and Off-Policy Actor-Critic
Figure 4 for Variance Penalized On-Policy and Off-Policy Actor-Critic
Viaarxiv icon