Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes


Nov 28, 2022
Aviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Oracle Inequalities for Model Selection in Offline Reinforcement Learning


Nov 03, 2022
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai, Emma Brunskill

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Model Selection in Batch Policy Optimization


Dec 23, 2021
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization


Dec 09, 2021
Aviral Kumar, Rishabh Agarwal, Tengyu Ma, Aaron Courville, George Tucker, Sergey Levine

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Coupled Gradient Estimators for Discrete Latent Variables


Jun 15, 2021
Zhe Dong, Andriy Mnih, George Tucker

Add code

* Under Review 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization


Apr 28, 2021
Michael R. Zhang, Tom Le Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi

Add code

* ICLR 2021. 17 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Benchmarks for Deep Off-Policy Evaluation


Mar 30, 2021
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine

Add code

* ICLR 2021 paper. Policies and evaluation code are available at https://github.com/google-research/deep_ope 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Offline Policy Selection under Uncertainty


Dec 12, 2020
Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

RL Unplugged: Benchmarks for Offline Reinforcement Learning


Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

Add code

* 21 pages including supplementary material, the github link for the datasets: https://github.com/deepmind/deepmind-research/rl_unplugged 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Conservative Q-Learning for Offline Reinforcement Learning


Jun 29, 2020
Aviral Kumar, Aurick Zhou, George Tucker, Sergey Levine

Add code

* Preprint. Website at: https://sites.google.com/view/cql-offline-rl 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
5
>>