Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Cosmin Paduraru

Active Offline Policy Selection

Jun 18, 2021
Ksenia Konyushkova, Yutian Chen, Thomas Paine, Caglar Gulcehre, Cosmin Paduraru, Daniel J Mankowitz, Misha Denil, Nando de Freitas

  Access Paper or Ask Questions

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization

Apr 28, 2021
Michael R. Zhang, Tom Le Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi

* ICLR 2021. 17 pages 

  Access Paper or Ask Questions

Benchmarks for Deep Off-Policy Evaluation

Mar 30, 2021
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine

* ICLR 2021 paper. Policies and evaluation code are available at 

  Access Paper or Ask Questions

Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification

Oct 20, 2020
Daniel J. Mankowitz, Dan A. Calian, Rae Jeong, Cosmin Paduraru, Nicolas Heess, Sumanth Dathathri, Martin Riedmiller, Timothy Mann

  Access Paper or Ask Questions

Hyperparameter Selection for Offline Reinforcement Learning

Jul 17, 2020
Tom Le Paine, Cosmin Paduraru, Andrea Michi, Caglar Gulcehre, Konrad Zolna, Alexander Novikov, Ziyu Wang, Nando de Freitas

  Access Paper or Ask Questions

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

* 21 pages including supplementary material, the github link for the datasets: 

  Access Paper or Ask Questions

An empirical investigation of the challenges of real-world reinforcement learning

Mar 24, 2020
Gabriel Dulac-Arnold, Nir Levine, Daniel J. Mankowitz, Jerry Li, Cosmin Paduraru, Sven Gowal, Todd Hester

* arXiv admin note: text overlap with arXiv:1904.12901 

  Access Paper or Ask Questions

Safe Exploration in Continuous Action Spaces

Jan 26, 2018
Gal Dalal, Krishnamurthy Dvijotham, Matej Vecerik, Todd Hester, Cosmin Paduraru, Yuval Tassa

  Access Paper or Ask Questions