Alert button
Picture for Doina Precup

Doina Precup

Alert button

Avoidance Learning Using Observational Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 24, 2019
David Venuto, Leonard Boussioux, Junhao Wang, Rola Dali, Jhelum Chakravorty, Yoshua Bengio, Doina Precup

Figure 1 for Avoidance Learning Using Observational Reinforcement Learning
Figure 2 for Avoidance Learning Using Observational Reinforcement Learning
Figure 3 for Avoidance Learning Using Observational Reinforcement Learning
Figure 4 for Avoidance Learning Using Observational Reinforcement Learning
Viaarxiv icon

Revisit Policy Optimization in Matrix Form

Add code
Bookmark button
Alert button
Sep 19, 2019
Sitao Luan, Xiao-Wen Chang, Doina Precup

Viaarxiv icon

An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation

Add code
Bookmark button
Alert button
Jul 31, 2019
Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal, Doina Precup

Figure 1 for An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Figure 2 for An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Figure 3 for An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Figure 4 for An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Viaarxiv icon

Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 05, 2019
Srinivas Venkattaramanujam, Eric Crawford, Thang Doan, Doina Precup

Figure 1 for Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Figure 2 for Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Figure 3 for Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Figure 4 for Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Viaarxiv icon

Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia

Add code
Bookmark button
Alert button
Jul 02, 2019
Charles C. Onu, Jonathan Lebensold, William L. Hamilton, Doina Precup

Figure 1 for Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Figure 2 for Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Figure 3 for Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Figure 4 for Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Viaarxiv icon

Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks

Add code
Bookmark button
Alert button
Jun 21, 2019
Sitao Luan, Mingde Zhao, Xiao-Wen Chang, Doina Precup

Figure 1 for Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Figure 2 for Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Figure 3 for Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Figure 4 for Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Viaarxiv icon

SVRG for Policy Evaluation with Fewer Gradient Evaluations

Add code
Bookmark button
Alert button
Jun 09, 2019
Zilun Peng, Ahmed Touati, Pascal Vincent, Doina Precup

Figure 1 for SVRG for Policy Evaluation with Fewer Gradient Evaluations
Figure 2 for SVRG for Policy Evaluation with Fewer Gradient Evaluations
Figure 3 for SVRG for Policy Evaluation with Fewer Gradient Evaluations
Viaarxiv icon

Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization

Add code
Bookmark button
Alert button
May 25, 2019
Mingde Zhao, Ian Porada, Sitao Luan, Xiaowen Chang, Doina Precup

Figure 1 for Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Figure 2 for Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Figure 3 for Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Figure 4 for Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Viaarxiv icon

Recurrent Value Functions

Add code
Bookmark button
Alert button
May 23, 2019
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup, Joelle Pineau

Figure 1 for Recurrent Value Functions
Figure 2 for Recurrent Value Functions
Figure 3 for Recurrent Value Functions
Figure 4 for Recurrent Value Functions
Viaarxiv icon