Picture for Dale Schuurmans

Dale Schuurmans

University of Alberta

Leveraging Non-uniformity in First-order Non-convex Optimization

Add code
May 13, 2021
Figure 1 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 2 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 3 for Leveraging Non-uniformity in First-order Non-convex Optimization
Figure 4 for Leveraging Non-uniformity in First-order Non-convex Optimization
Viaarxiv icon

Joint Attention for Multi-Agent Coordination and Social Learning

Add code
Apr 15, 2021
Figure 1 for Joint Attention for Multi-Agent Coordination and Social Learning
Figure 2 for Joint Attention for Multi-Agent Coordination and Social Learning
Figure 3 for Joint Attention for Multi-Agent Coordination and Social Learning
Figure 4 for Joint Attention for Multi-Agent Coordination and Social Learning
Viaarxiv icon

On the Optimality of Batch Policy Optimization Algorithms

Add code
Apr 06, 2021
Figure 1 for On the Optimality of Batch Policy Optimization Algorithms
Figure 2 for On the Optimality of Batch Policy Optimization Algorithms
Viaarxiv icon

Optimization Issues in KL-Constrained Approximate Policy Iteration

Add code
Feb 11, 2021
Figure 1 for Optimization Issues in KL-Constrained Approximate Policy Iteration
Figure 2 for Optimization Issues in KL-Constrained Approximate Policy Iteration
Figure 3 for Optimization Issues in KL-Constrained Approximate Policy Iteration
Figure 4 for Optimization Issues in KL-Constrained Approximate Policy Iteration
Viaarxiv icon

Offline Policy Selection under Uncertainty

Add code
Dec 12, 2020
Figure 1 for Offline Policy Selection under Uncertainty
Figure 2 for Offline Policy Selection under Uncertainty
Figure 3 for Offline Policy Selection under Uncertainty
Figure 4 for Offline Policy Selection under Uncertainty
Viaarxiv icon

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration

Add code
Nov 10, 2020
Figure 1 for Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration
Figure 2 for Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration
Figure 3 for Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration
Figure 4 for Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration
Viaarxiv icon

CoinDICE: Off-Policy Confidence Interval Estimation

Add code
Oct 22, 2020
Figure 1 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 2 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 3 for CoinDICE: Off-Policy Confidence Interval Estimation
Viaarxiv icon

Attention that does not Explain Away

Add code
Sep 29, 2020
Figure 1 for Attention that does not Explain Away
Figure 2 for Attention that does not Explain Away
Figure 3 for Attention that does not Explain Away
Figure 4 for Attention that does not Explain Away
Viaarxiv icon

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Add code
Jul 21, 2020
Figure 1 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Figure 2 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Figure 3 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Figure 4 for EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Viaarxiv icon

Off-Policy Evaluation via the Regularized Lagrangian

Add code
Jul 07, 2020
Figure 1 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 2 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 3 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 4 for Off-Policy Evaluation via the Regularized Lagrangian
Viaarxiv icon