Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations


Jul 27, 2022
Kuang-Huei Lee, Ofir Nachum, Tingnan Zhang, Sergio Guadarrama, Jie Tan, Wenhao Yu

* To appear at IROS 2022. The supplementary video is available at https://kuanghuei.github.io/piars 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Joint Representation Training in Sequential Tasks with Shared Structure


Jun 24, 2022
Aldo Pacchiano, Ofir Nachum, Nilseh Tripuraneni, Peter Bartlett


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Mixture-of-Expert Approach to RL-based Dialogue Management


May 31, 2022
Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, MoonKyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Multi-Game Decision Transformers


May 30, 2022
Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters


May 27, 2022
Seyed Kamyar Seyed Ghasemipour, Shixiang Shane Gu, Ofir Nachum

* Our codebase can be found at https://github.com/google-research/google-research/tree/master/jrl 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Chain of Thought Imitation with Procedure Cloning


May 22, 2022
Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error


Jan 28, 2022
Scott Fujimoto, David Meger, Doina Precup, Ofir Nachum, Shixiang Shane Gu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Model Selection in Batch Policy Optimization


Dec 23, 2021
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions


Nov 29, 2021
Bogdan Mazoure, Ilya Kostrikov, Ofir Nachum, Jonathan Tompson

* Offline RL workshop at NeurIPS 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

TRAIL: Near-Optimal Imitation Learning with Suboptimal Data


Oct 27, 2021
Mengjiao Yang, Sergey Levine, Ofir Nachum


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
5
6
>>