Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Offline Policy Selection under Uncertainty

Dec 12, 2020
Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans


  Access Paper or Ask Questions

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning

Oct 27, 2020
Anurag Ajay, Aviral Kumar, Pulkit Agrawal, Sergey Levine, Ofir Nachum

* https://sites.google.com/view/opal-iclr 

  Access Paper or Ask Questions

CoinDICE: Off-Policy Confidence Interval Estimation

Oct 22, 2020
Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans

* To appear at NeurIPS 2020 as spotlight 

  Access Paper or Ask Questions

Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation

Jul 27, 2020
Ilya Kostrikov, Ofir Nachum


  Access Paper or Ask Questions

Off-Policy Evaluation via the Regularized Lagrangian

Jul 07, 2020
Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans


  Access Paper or Ask Questions

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

* 21 pages including supplementary material, the github link for the datasets: https://github.com/deepmind/deepmind-research/rl_unplugged 

  Access Paper or Ask Questions

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization

Jun 23, 2020
Tatsuya Matsushima, Hiroki Furuta, Yutaka Matsuo, Ofir Nachum, Shixiang Gu


  Access Paper or Ask Questions

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Apr 20, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

* Website available at https://sites.google.com/view/d4rl/home 

  Access Paper or Ask Questions

Datasets for Data-Driven Reinforcement Learning

Apr 15, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine


  Access Paper or Ask Questions

BRPO: Batch Residual Policy Optimization

Feb 08, 2020
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier


  Access Paper or Ask Questions

Reinforcement Learning via Fenchel-Rockafellar Duality

Jan 09, 2020
Ofir Nachum, Bo Dai


  Access Paper or Ask Questions

Imitation Learning via Off-Policy Distribution Matching

Dec 10, 2019
Ilya Kostrikov, Ofir Nachum, Jonathan Tompson


  Access Paper or Ask Questions

AlgaeDICE: Policy Gradient from Arbitrary Experience

Dec 04, 2019
Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans


  Access Paper or Ask Questions

Behavior Regularized Offline Reinforcement Learning

Nov 26, 2019
Yifan Wu, George Tucker, Ofir Nachum


  Access Paper or Ask Questions

Group-based Fair Learning Leads to Counter-intuitive Predictions

Oct 04, 2019
Ofir Nachum, Heinrich Jiang


  Access Paper or Ask Questions

Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

Sep 23, 2019
Ofir Nachum, Haoran Tang, Xingyu Lu, Shixiang Gu, Honglak Lee, Sergey Levine


  Access Paper or Ask Questions

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real

Aug 13, 2019
Ofir Nachum, Michael Ahn, Hugo Ponte, Shixiang Gu, Vikash Kumar


  Access Paper or Ask Questions

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections

Jun 10, 2019
Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li


  Access Paper or Ask Questions

DeepMDP: Learning Continuous Latent Space Models for Representation Learning

Jun 06, 2019
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare

* 13 pages main text, 16 pages appendix. ICML 2019 

  Access Paper or Ask Questions

Lyapunov-based Safe Policy Optimization for Continuous Control

Jan 28, 2019
Yinlam Chow, Ofir Nachum, Aleksandra Faust, Mohammad Ghavamzadeh, Edgar Duenez-Guzman


  Access Paper or Ask Questions

Identifying and Correcting Label Bias in Machine Learning

Jan 15, 2019
Heinrich Jiang, Ofir Nachum


  Access Paper or Ask Questions

The Laplacian in RL: Learning Representations with Efficient Approximations

Oct 10, 2018
Yifan Wu, George Tucker, Ofir Nachum


  Access Paper or Ask Questions

Data-Efficient Hierarchical Reinforcement Learning

Oct 05, 2018
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine

* NIPS 2018 

  Access Paper or Ask Questions

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

Oct 02, 2018
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine


  Access Paper or Ask Questions

Smoothed Action Value Functions for Learning Gaussian Policies

Jul 25, 2018
Ofir Nachum, Mohammad Norouzi, George Tucker, Dale Schuurmans

* ICML 2018 

  Access Paper or Ask Questions

A Lyapunov-based Approach to Safe Reinforcement Learning

May 20, 2018
Yinlam Chow, Ofir Nachum, Edgar Duenez-Guzman, Mohammad Ghavamzadeh


  Access Paper or Ask Questions

MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks

Apr 17, 2018
Ariel Gordon, Elad Eban, Ofir Nachum, Bo Chen, Hao Wu, Tien-Ju Yang, Edward Choi

* Added reproducibility and stability figures in the appendix, as well minor typos and clarifications to the main text 

  Access Paper or Ask Questions