Alert button
Picture for Ofir Nachum

Ofir Nachum

Alert button

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization

Add code
Bookmark button
Alert button
Jun 23, 2020
Tatsuya Matsushima, Hiroki Furuta, Yutaka Matsuo, Ofir Nachum, Shixiang Gu

Figure 1 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 2 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 3 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 4 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Viaarxiv icon

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 20, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

Figure 1 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 2 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 3 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Viaarxiv icon

Datasets for Data-Driven Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 15, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

Figure 1 for Datasets for Data-Driven Reinforcement Learning
Figure 2 for Datasets for Data-Driven Reinforcement Learning
Figure 3 for Datasets for Data-Driven Reinforcement Learning
Viaarxiv icon

BRPO: Batch Residual Policy Optimization

Add code
Bookmark button
Alert button
Feb 08, 2020
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier

Figure 1 for BRPO: Batch Residual Policy Optimization
Figure 2 for BRPO: Batch Residual Policy Optimization
Figure 3 for BRPO: Batch Residual Policy Optimization
Figure 4 for BRPO: Batch Residual Policy Optimization
Viaarxiv icon

Reinforcement Learning via Fenchel-Rockafellar Duality

Add code
Bookmark button
Alert button
Jan 09, 2020
Ofir Nachum, Bo Dai

Figure 1 for Reinforcement Learning via Fenchel-Rockafellar Duality
Viaarxiv icon

Imitation Learning via Off-Policy Distribution Matching

Add code
Bookmark button
Alert button
Dec 10, 2019
Ilya Kostrikov, Ofir Nachum, Jonathan Tompson

Figure 1 for Imitation Learning via Off-Policy Distribution Matching
Figure 2 for Imitation Learning via Off-Policy Distribution Matching
Figure 3 for Imitation Learning via Off-Policy Distribution Matching
Figure 4 for Imitation Learning via Off-Policy Distribution Matching
Viaarxiv icon

AlgaeDICE: Policy Gradient from Arbitrary Experience

Add code
Bookmark button
Alert button
Dec 04, 2019
Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans

Figure 1 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 2 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 3 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 4 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Viaarxiv icon

Behavior Regularized Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 26, 2019
Yifan Wu, George Tucker, Ofir Nachum

Figure 1 for Behavior Regularized Offline Reinforcement Learning
Figure 2 for Behavior Regularized Offline Reinforcement Learning
Figure 3 for Behavior Regularized Offline Reinforcement Learning
Figure 4 for Behavior Regularized Offline Reinforcement Learning
Viaarxiv icon