Picture for Nando de Freitas

Nando de Freitas

University of British Columbia

On Instrumental Variable Regression for Deep Offline Policy Evaluation

Add code
May 21, 2021
Figure 1 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 2 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 3 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 4 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Viaarxiv icon

Regularized Behavior Value Estimation

Add code
Mar 17, 2021
Figure 1 for Regularized Behavior Value Estimation
Figure 2 for Regularized Behavior Value Estimation
Figure 3 for Regularized Behavior Value Estimation
Figure 4 for Regularized Behavior Value Estimation
Viaarxiv icon

Semi-supervised reward learning for offline reinforcement learning

Add code
Dec 12, 2020
Figure 1 for Semi-supervised reward learning for offline reinforcement learning
Figure 2 for Semi-supervised reward learning for offline reinforcement learning
Figure 3 for Semi-supervised reward learning for offline reinforcement learning
Figure 4 for Semi-supervised reward learning for offline reinforcement learning
Viaarxiv icon

Offline Learning from Demonstrations and Unlabeled Experience

Add code
Nov 27, 2020
Figure 1 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 2 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 3 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 4 for Offline Learning from Demonstrations and Unlabeled Experience
Viaarxiv icon

Large-scale multilingual audio visual dubbing

Add code
Nov 06, 2020
Figure 1 for Large-scale multilingual audio visual dubbing
Figure 2 for Large-scale multilingual audio visual dubbing
Figure 3 for Large-scale multilingual audio visual dubbing
Figure 4 for Large-scale multilingual audio visual dubbing
Viaarxiv icon

Learning Deep Features in Instrumental Variable Regression

Add code
Nov 01, 2020
Figure 1 for Learning Deep Features in Instrumental Variable Regression
Figure 2 for Learning Deep Features in Instrumental Variable Regression
Figure 3 for Learning Deep Features in Instrumental Variable Regression
Figure 4 for Learning Deep Features in Instrumental Variable Regression
Viaarxiv icon

Learning Compositional Neural Programs for Continuous Control

Add code
Jul 27, 2020
Figure 1 for Learning Compositional Neural Programs for Continuous Control
Figure 2 for Learning Compositional Neural Programs for Continuous Control
Figure 3 for Learning Compositional Neural Programs for Continuous Control
Figure 4 for Learning Compositional Neural Programs for Continuous Control
Viaarxiv icon

Hyperparameter Selection for Offline Reinforcement Learning

Add code
Jul 17, 2020
Figure 1 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 2 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 3 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 4 for Hyperparameter Selection for Offline Reinforcement Learning
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Jul 02, 2020
Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Critic Regularized Regression

Add code
Jun 26, 2020
Figure 1 for Critic Regularized Regression
Figure 2 for Critic Regularized Regression
Figure 3 for Critic Regularized Regression
Figure 4 for Critic Regularized Regression
Viaarxiv icon