Picture for Ofir Nachum

Ofir Nachum

Tony

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Jul 02, 2020
Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization

Add code
Jun 23, 2020
Figure 1 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 2 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 3 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 4 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Viaarxiv icon

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Add code
Apr 20, 2020
Figure 1 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 2 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 3 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Viaarxiv icon

BRPO: Batch Residual Policy Optimization

Add code
Feb 08, 2020
Figure 1 for BRPO: Batch Residual Policy Optimization
Figure 2 for BRPO: Batch Residual Policy Optimization
Figure 3 for BRPO: Batch Residual Policy Optimization
Figure 4 for BRPO: Batch Residual Policy Optimization
Viaarxiv icon

Reinforcement Learning via Fenchel-Rockafellar Duality

Add code
Jan 09, 2020
Figure 1 for Reinforcement Learning via Fenchel-Rockafellar Duality
Viaarxiv icon

Imitation Learning via Off-Policy Distribution Matching

Add code
Dec 10, 2019
Figure 1 for Imitation Learning via Off-Policy Distribution Matching
Figure 2 for Imitation Learning via Off-Policy Distribution Matching
Figure 3 for Imitation Learning via Off-Policy Distribution Matching
Figure 4 for Imitation Learning via Off-Policy Distribution Matching
Viaarxiv icon

AlgaeDICE: Policy Gradient from Arbitrary Experience

Add code
Dec 04, 2019
Figure 1 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 2 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 3 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 4 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Viaarxiv icon

Behavior Regularized Offline Reinforcement Learning

Add code
Nov 26, 2019
Figure 1 for Behavior Regularized Offline Reinforcement Learning
Figure 2 for Behavior Regularized Offline Reinforcement Learning
Figure 3 for Behavior Regularized Offline Reinforcement Learning
Figure 4 for Behavior Regularized Offline Reinforcement Learning
Viaarxiv icon

Group-based Fair Learning Leads to Counter-intuitive Predictions

Add code
Oct 04, 2019
Figure 1 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Figure 2 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Figure 3 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Figure 4 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Viaarxiv icon

Why Does Hierarchy Work So Well in Reinforcement Learning?

Add code
Sep 23, 2019
Figure 1 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 2 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 3 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 4 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Viaarxiv icon