Alert button
Picture for Dale Schuurmans

Dale Schuurmans

Alert button

Scalable Deep Generative Modeling for Sparse Graphs

Add code
Bookmark button
Alert button
Jun 28, 2020
Hanjun Dai, Azade Nazi, Yujia Li, Bo Dai, Dale Schuurmans

Figure 1 for Scalable Deep Generative Modeling for Sparse Graphs
Figure 2 for Scalable Deep Generative Modeling for Sparse Graphs
Figure 3 for Scalable Deep Generative Modeling for Sparse Graphs
Figure 4 for Scalable Deep Generative Modeling for Sparse Graphs
Viaarxiv icon

A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Add code
Bookmark button
Alert button
Jun 17, 2020
Nevena Lazic, Dong Yin, Mehrdad Farajtabar, Nir Levine, Dilan Gorur, Chris Harris, Dale Schuurmans

Figure 1 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Figure 2 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Viaarxiv icon

On the Global Convergence Rates of Softmax Policy Gradient Methods

Add code
Bookmark button
Alert button
May 13, 2020
Jincheng Mei, Chenjun Xiao, Csaba Szepesvari, Dale Schuurmans

Figure 1 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Figure 2 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Figure 3 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Figure 4 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Viaarxiv icon

Energy-Based Processes for Exchangeable Data

Add code
Bookmark button
Alert button
Mar 17, 2020
Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans

Figure 1 for Energy-Based Processes for Exchangeable Data
Figure 2 for Energy-Based Processes for Exchangeable Data
Figure 3 for Energy-Based Processes for Exchangeable Data
Figure 4 for Energy-Based Processes for Exchangeable Data
Viaarxiv icon

Variational Inference for Deep Probabilistic Canonical Correlation Analysis

Add code
Bookmark button
Alert button
Mar 09, 2020
Mahdi Karami, Dale Schuurmans

Figure 1 for Variational Inference for Deep Probabilistic Canonical Correlation Analysis
Figure 2 for Variational Inference for Deep Probabilistic Canonical Correlation Analysis
Figure 3 for Variational Inference for Deep Probabilistic Canonical Correlation Analysis
Figure 4 for Variational Inference for Deep Probabilistic Canonical Correlation Analysis
Viaarxiv icon

Batch Stationary Distribution Estimation

Add code
Bookmark button
Alert button
Mar 02, 2020
Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans

Figure 1 for Batch Stationary Distribution Estimation
Figure 2 for Batch Stationary Distribution Estimation
Figure 3 for Batch Stationary Distribution Estimation
Figure 4 for Batch Stationary Distribution Estimation
Viaarxiv icon

ConQUR: Mitigating Delusional Bias in Deep Q-learning

Add code
Bookmark button
Alert button
Feb 27, 2020
Andy Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier

Figure 1 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 2 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 3 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 4 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Viaarxiv icon

GenDICE: Generalized Offline Estimation of Stationary Values

Add code
Bookmark button
Alert button
Feb 21, 2020
Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans

Figure 1 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 2 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 3 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 4 for GenDICE: Generalized Offline Estimation of Stationary Values
Viaarxiv icon

Learning to Combat Compounding-Error in Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 24, 2019
Chenjun Xiao, Yifan Wu, Chen Ma, Dale Schuurmans, Martin Müller

Figure 1 for Learning to Combat Compounding-Error in Model-Based Reinforcement Learning
Figure 2 for Learning to Combat Compounding-Error in Model-Based Reinforcement Learning
Figure 3 for Learning to Combat Compounding-Error in Model-Based Reinforcement Learning
Figure 4 for Learning to Combat Compounding-Error in Model-Based Reinforcement Learning
Viaarxiv icon

AlgaeDICE: Policy Gradient from Arbitrary Experience

Add code
Bookmark button
Alert button
Dec 04, 2019
Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans

Figure 1 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 2 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 3 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 4 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Viaarxiv icon