Alert button
Picture for Ofir Nachum

Ofir Nachum

Alert button

Representation Matters: Offline Pretraining for Sequential Decision Making

Add code
Bookmark button
Alert button
Feb 11, 2021
Mengjiao Yang, Ofir Nachum

Figure 1 for Representation Matters: Offline Pretraining for Sequential Decision Making
Figure 2 for Representation Matters: Offline Pretraining for Sequential Decision Making
Figure 3 for Representation Matters: Offline Pretraining for Sequential Decision Making
Figure 4 for Representation Matters: Offline Pretraining for Sequential Decision Making
Viaarxiv icon

Offline Policy Selection under Uncertainty

Add code
Bookmark button
Alert button
Dec 12, 2020
Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans

Figure 1 for Offline Policy Selection under Uncertainty
Figure 2 for Offline Policy Selection under Uncertainty
Figure 3 for Offline Policy Selection under Uncertainty
Figure 4 for Offline Policy Selection under Uncertainty
Viaarxiv icon

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 27, 2020
Anurag Ajay, Aviral Kumar, Pulkit Agrawal, Sergey Levine, Ofir Nachum

Figure 1 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Figure 2 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Figure 3 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Figure 4 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Viaarxiv icon

CoinDICE: Off-Policy Confidence Interval Estimation

Add code
Bookmark button
Alert button
Oct 22, 2020
Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans

Figure 1 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 2 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 3 for CoinDICE: Off-Policy Confidence Interval Estimation
Viaarxiv icon

Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation

Add code
Bookmark button
Alert button
Jul 27, 2020
Ilya Kostrikov, Ofir Nachum

Figure 1 for Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation
Figure 2 for Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation
Figure 3 for Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation
Viaarxiv icon

Off-Policy Evaluation via the Regularized Lagrangian

Add code
Bookmark button
Alert button
Jul 07, 2020
Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans

Figure 1 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 2 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 3 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 4 for Off-Policy Evaluation via the Regularized Lagrangian
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization

Add code
Bookmark button
Alert button
Jun 23, 2020
Tatsuya Matsushima, Hiroki Furuta, Yutaka Matsuo, Ofir Nachum, Shixiang Gu

Figure 1 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 2 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 3 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 4 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Viaarxiv icon

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 20, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

Figure 1 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 2 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 3 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Viaarxiv icon

BRPO: Batch Residual Policy Optimization

Add code
Bookmark button
Alert button
Feb 08, 2020
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier

Figure 1 for BRPO: Batch Residual Policy Optimization
Figure 2 for BRPO: Batch Residual Policy Optimization
Figure 3 for BRPO: Batch Residual Policy Optimization
Figure 4 for BRPO: Batch Residual Policy Optimization
Viaarxiv icon