Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Tom Le Paine

On Instrumental Variable Regression for Deep Offline Policy Evaluation


May 21, 2021
Yutian Chen, Liyuan Xu, Caglar Gulcehre, Tom Le Paine, Arthur Gretton, Nando de Freitas, Arnaud Doucet


  Access Paper or Ask Questions

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization


Apr 28, 2021
Michael R. Zhang, Tom Le Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi

* ICLR 2021. 17 pages 

  Access Paper or Ask Questions

Benchmarks for Deep Off-Policy Evaluation


Mar 30, 2021
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine

* ICLR 2021 paper. Policies and evaluation code are available at https://github.com/google-research/deep_ope 

  Access Paper or Ask Questions

Hyperparameter Selection for Offline Reinforcement Learning


Jul 17, 2020
Tom Le Paine, Cosmin Paduraru, Andrea Michi, Caglar Gulcehre, Konrad Zolna, Alexander Novikov, Ziyu Wang, Nando de Freitas


  Access Paper or Ask Questions

RL Unplugged: Benchmarks for Offline Reinforcement Learning


Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

* 21 pages including supplementary material, the github link for the datasets: https://github.com/deepmind/deepmind-research/rl_unplugged 

  Access Paper or Ask Questions

Acme: A Research Framework for Distributed Reinforcement Learning


Jun 01, 2020
Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alex Novikov, Sergio G贸mez Colmenarejo, Serkan Cabi, Caglar Gulcehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas


  Access Paper or Ask Questions

Improving the Gating Mechanism of Recurrent Neural Networks


Oct 22, 2019
Albert Gu, Caglar Gulcehre, Tom Le Paine, Matt Hoffman, Razvan Pascanu


  Access Paper or Ask Questions

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems


Sep 03, 2019
Tom Le Paine, Caglar Gulcehre, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team


  Access Paper or Ask Questions

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL


Oct 11, 2018
Tom Le Paine, Sergio G贸mez Colmenarejo, Ziyu Wang, Scott Reed, Yusuf Aytar, Tobias Pfaff, Matt W. Hoffman, Gabriel Barth-Maron, Serkan Cabi, David Budden, Nando de Freitas


  Access Paper or Ask Questions

Playing hard exploration games by watching YouTube


May 29, 2018
Yusuf Aytar, Tobias Pfaff, David Budden, Tom Le Paine, Ziyu Wang, Nando de Freitas


  Access Paper or Ask Questions

Fast Generation for Convolutional Autoregressive Models


Apr 20, 2017
Prajit Ramachandran, Tom Le Paine, Pooya Khorrami, Mohammad Babaeizadeh, Shiyu Chang, Yang Zhang, Mark A. Hasegawa-Johnson, Roy H. Campbell, Thomas S. Huang

* Accepted at ICLR 2017 Workshop 

  Access Paper or Ask Questions

Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition?


Mar 16, 2017
Pooya Khorrami, Tom Le Paine, Thomas S. Huang

* Accepted at ICCV 2015 CV4AC Workshop. Corrected numbers in Tables 2 and 3 

  Access Paper or Ask Questions

How Deep Neural Networks Can Improve Emotion Recognition on Video Data


Jan 10, 2017
Pooya Khorrami, Tom Le Paine, Kevin Brady, Charlie Dagli, Thomas S. Huang

* Accepted at ICIP 2016. Fixed typo in Experiments section 

  Access Paper or Ask Questions

Fast Wavenet Generation Algorithm


Nov 29, 2016
Tom Le Paine, Pooya Khorrami, Shiyu Chang, Yang Zhang, Prajit Ramachandran, Mark A. Hasegawa-Johnson, Thomas S. Huang

* Technical Report 

  Access Paper or Ask Questions

Seq-NMS for Video Object Detection


Aug 22, 2016
Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad Babaeizadeh, Honghui Shi, Jianan Li, Shuicheng Yan, Thomas S. Huang

* Technical Report for Imagenet VID Competition 2015 

  Access Paper or Ask Questions

An Analysis of Unsupervised Pre-training in Light of Recent Advances


Apr 10, 2015
Tom Le Paine, Pooya Khorrami, Wei Han, Thomas S. Huang

* Accepted as a workshop contribution to ICLR 2015 

  Access Paper or Ask Questions