Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Caglar Gulcehre

Active Offline Policy Selection


Jun 18, 2021
Ksenia Konyushkova, Yutian Chen, Thomas Paine, Caglar Gulcehre, Cosmin Paduraru, Daniel J Mankowitz, Misha Denil, Nando de Freitas


  Access Paper or Ask Questions

On Instrumental Variable Regression for Deep Offline Policy Evaluation


May 21, 2021
Yutian Chen, Liyuan Xu, Caglar Gulcehre, Tom Le Paine, Arthur Gretton, Nando de Freitas, Arnaud Doucet


  Access Paper or Ask Questions

Regularized Behavior Value Estimation


Mar 17, 2021
Caglar Gulcehre, Sergio GĂłmez Colmenarejo, Ziyu Wang, Jakub Sygnowski, Thomas Paine, Konrad Zolna, Yutian Chen, Matthew Hoffman, Razvan Pascanu, Nando de Freitas


  Access Paper or Ask Questions

Offline Learning from Demonstrations and Unlabeled Experience


Nov 27, 2020
Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Caglar Gulcehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott Reed

* Accepted to Offline Reinforcement Learning Workshop at Neural Information Processing Systems (2020) 

  Access Paper or Ask Questions

Hyperparameter Selection for Offline Reinforcement Learning


Jul 17, 2020
Tom Le Paine, Cosmin Paduraru, Andrea Michi, Caglar Gulcehre, Konrad Zolna, Alexander Novikov, Ziyu Wang, Nando de Freitas


  Access Paper or Ask Questions

RL Unplugged: Benchmarks for Offline Reinforcement Learning


Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

* 21 pages including supplementary material, the github link for the datasets: https://github.com/deepmind/deepmind-research/rl_unplugged 

  Access Paper or Ask Questions

Critic Regularized Regression


Jun 26, 2020
Ziyu Wang, Alexander Novikov, Konrad Żołna, Jost Tobias Springenberg, Scott Reed, Bobak Shahriari, Noah Siegel, Josh Merel, Caglar Gulcehre, Nicolas Heess, Nando de Freitas

* 23 pages 

  Access Paper or Ask Questions

Acme: A Research Framework for Distributed Reinforcement Learning


Jun 01, 2020
Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alex Novikov, Sergio GĂłmez Colmenarejo, Serkan Cabi, Caglar Gulcehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas


  Access Paper or Ask Questions

Improving the Gating Mechanism of Recurrent Neural Networks


Oct 22, 2019
Albert Gu, Caglar Gulcehre, Tom Le Paine, Matt Hoffman, Razvan Pascanu


  Access Paper or Ask Questions

Stabilizing Transformers for Reinforcement Learning


Oct 13, 2019
Emilio Parisotto, H. Francis Song, Jack W. Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant M. Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew M. Botvinick, Nicolas Heess, Raia Hadsell


  Access Paper or Ask Questions

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems


Sep 03, 2019
Tom Le Paine, Caglar Gulcehre, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team


  Access Paper or Ask Questions

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL


Oct 19, 2018
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas


  Access Paper or Ask Questions

Relational inductive biases, deep learning, and graph networks


Oct 17, 2018
Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu


  Access Paper or Ask Questions

Sample Efficient Adaptive Text-to-Speech


Sep 27, 2018
Yutian Chen, Yannis Assael, Brendan Shillingford, David Budden, Scott Reed, Heiga Zen, Quan Wang, Luis C. Cobo, Andrew Trask, Ben Laurie, Caglar Gulcehre, Aäron van den Oord, Oriol Vinyals, Nando de Freitas


  Access Paper or Ask Questions

Hyperbolic Attention Networks


May 24, 2018
Caglar Gulcehre, Misha Denil, Mateusz Malinowski, Ali Razavi, Razvan Pascanu, Karl Moritz Hermann, Peter Battaglia, Victor Bapst, David Raposo, Adam Santoro, Nando de Freitas


  Access Paper or Ask Questions

Plan, Attend, Generate: Planning for Sequence-to-Sequence Models


Nov 28, 2017
Francis Dutil, Caglar Gulcehre, Adam Trischler, Yoshua Bengio

* NIPS 2017 

  Access Paper or Ask Questions

Gated Orthogonal Recurrent Units: On Learning to Forget


Oct 25, 2017
Li Jing, Caglar Gulcehre, John Peurifoy, Yichen Shen, Max Tegmark, Marin Soljačić, Yoshua Bengio


  Access Paper or Ask Questions

Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder


Jun 23, 2017
Caglar Gulcehre, Francis Dutil, Adam Trischler, Yoshua Bengio

* Accepted to Rep4NLP 2017 Workshop at ACL 2017 Conference 

  Access Paper or Ask Questions

Machine Comprehension by Text-to-Text Neural Question Generation


May 15, 2017
Xingdi Yuan, Tong Wang, Caglar Gulcehre, Alessandro Sordoni, Philip Bachman, Sandeep Subramanian, Saizheng Zhang, Adam Trischler


  Access Paper or Ask Questions

Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes


Mar 17, 2017
Caglar Gulcehre, Sarath Chandar, Kyunghyun Cho, Yoshua Bengio

* 13 pages, 3 figures 

  Access Paper or Ask Questions

A Robust Adaptive Stochastic Gradient Method for Deep Learning


Mar 02, 2017
Caglar Gulcehre, Jose Sotelo, Marcin Moczulski, Yoshua Bengio

* IJCNN 2017 Accepted Paper, An extension of our paper, "ADASECANT: Robust Adaptive Secant Method for Stochastic Gradient" 

  Access Paper or Ask Questions

Memory Augmented Neural Networks with Wormhole Connections


Jan 30, 2017
Caglar Gulcehre, Sarath Chandar, Yoshua Bengio


  Access Paper or Ask Questions

Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond


Aug 26, 2016
Ramesh Nallapati, Bowen Zhou, Cicero Nogueira dos santos, Caglar Gulcehre, Bing Xiang

* The SIGNLL Conference on Computational Natural Language Learning (CoNLL), 2016 

  Access Paper or Ask Questions

Pointing the Unknown Words


Aug 21, 2016
Caglar Gulcehre, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, Yoshua Bengio

* ACL 2016 Oral Paper 

  Access Paper or Ask Questions

Mollifying Networks


Aug 17, 2016
Caglar Gulcehre, Marcin Moczulski, Francesco Visin, Yoshua Bengio


  Access Paper or Ask Questions

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus


May 29, 2016
Iulian Vlad Serban, Alberto García-Durán, Caglar Gulcehre, Sungjin Ahn, Sarath Chandar, Aaron Courville, Yoshua Bengio

* 13 pages, 1 figure, 7 tables 

  Access Paper or Ask Questions

Theano: A Python framework for fast computation of mathematical expressions


May 09, 2016
The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Mélanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian Goodfellow, Matt Graham, Caglar Gulcehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrancois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert T. McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang

* 19 pages, 5 figures 

  Access Paper or Ask Questions

Noisy Activation Functions


Apr 03, 2016
Caglar Gulcehre, Marcin Moczulski, Misha Denil, Yoshua Bengio


  Access Paper or Ask Questions

Policy Distillation


Jan 07, 2016
Andrei A. Rusu, Sergio Gomez Colmenarejo, Caglar Gulcehre, Guillaume Desjardins, James Kirkpatrick, Razvan Pascanu, Volodymyr Mnih, Koray Kavukcuoglu, Raia Hadsell

* Submitted to ICLR 2016 

  Access Paper or Ask Questions