Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Towards Learning Universal Hyperparameter Optimizers with Transformers



Yutian Chen , Xingyou Song , Chansoo Lee , Zi Wang , Qiuyi Zhang , David Dohan , Kazuya Kawakami , Greg Kochanski , Arnaud Doucet , Marc'aurelio Ranzato , Sagi Perel , Nando de Freitas


   Access Paper or Ask Questions

A Generalist Agent



Scott Reed , Konrad Zolna , Emilio Parisotto , Sergio Gomez Colmenarejo , Alexander Novikov , Gabriel Barth-Maron , Mai Gimenez , Yury Sulsky , Jackie Kay , Jost Tobias Springenberg , Tom Eccles , Jake Bruce , Ali Razavi , Ashley Edwards , Nicolas Heess , Yutian Chen , Raia Hadsell , Oriol Vinyals , Mahyar Bordbar , Nando de Freitas


   Access Paper or Ask Questions

Shaking the foundations: delusions in sequence models for interaction and control



Pedro A. Ortega , Markus Kunesch , Grégoire Delétang , Tim Genewein , Jordi Grau-Moya , Joel Veness , Jonas Buchli , Jonas Degrave , Bilal Piot , Julien Perolat , Tom Everitt , Corentin Tallec , Emilio Parisotto , Tom Erez , Yutian Chen , Scott Reed , Marcus Hutter , Nando de Freitas , Shane Legg

* DeepMind Tech Report, 16 pages, 4 figures 

   Access Paper or Ask Questions

Active Offline Policy Selection



Ksenia Konyushkova , Yutian Chen , Thomas Paine , Caglar Gulcehre , Cosmin Paduraru , Daniel J Mankowitz , Misha Denil , Nando de Freitas


   Access Paper or Ask Questions

On Instrumental Variable Regression for Deep Offline Policy Evaluation



Yutian Chen , Liyuan Xu , Caglar Gulcehre , Tom Le Paine , Arthur Gretton , Nando de Freitas , Arnaud Doucet


   Access Paper or Ask Questions

Regularized Behavior Value Estimation



Caglar Gulcehre , Sergio Gómez Colmenarejo , Ziyu Wang , Jakub Sygnowski , Thomas Paine , Konrad Zolna , Yutian Chen , Matthew Hoffman , Razvan Pascanu , Nando de Freitas


   Access Paper or Ask Questions

Semi-supervised reward learning for offline reinforcement learning



Ksenia Konyushkova , Konrad Zolna , Yusuf Aytar , Alexander Novikov , Scott Reed , Serkan Cabi , Nando de Freitas

* Accepted to Offline Reinforcement Learning Workshop at Neural Information Processing Systems (2020) 

   Access Paper or Ask Questions

Offline Learning from Demonstrations and Unlabeled Experience



Konrad Zolna , Alexander Novikov , Ksenia Konyushkova , Caglar Gulcehre , Ziyu Wang , Yusuf Aytar , Misha Denil , Nando de Freitas , Scott Reed

* Accepted to Offline Reinforcement Learning Workshop at Neural Information Processing Systems (2020) 

   Access Paper or Ask Questions

Large-scale multilingual audio visual dubbing



Yi Yang , Brendan Shillingford , Yannis Assael , Miaosen Wang , Wendi Liu , Yutian Chen , Yu Zhang , Eren Sezener , Luis C. Cobo , Misha Denil , Yusuf Aytar , Nando de Freitas

* 26 pages, 8 figures 

   Access Paper or Ask Questions

1
2
3
4
5
6
7
>>