Alert button
Picture for Caglar Gulcehre

Caglar Gulcehre

Alert button

On Instrumental Variable Regression for Deep Offline Policy Evaluation

Add code
Bookmark button
Alert button
May 21, 2021
Yutian Chen, Liyuan Xu, Caglar Gulcehre, Tom Le Paine, Arthur Gretton, Nando de Freitas, Arnaud Doucet

Figure 1 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 2 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 3 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 4 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Viaarxiv icon

Regularized Behavior Value Estimation

Add code
Bookmark button
Alert button
Mar 17, 2021
Caglar Gulcehre, Sergio Gómez Colmenarejo, Ziyu Wang, Jakub Sygnowski, Thomas Paine, Konrad Zolna, Yutian Chen, Matthew Hoffman, Razvan Pascanu, Nando de Freitas

Figure 1 for Regularized Behavior Value Estimation
Figure 2 for Regularized Behavior Value Estimation
Figure 3 for Regularized Behavior Value Estimation
Figure 4 for Regularized Behavior Value Estimation
Viaarxiv icon

Offline Learning from Demonstrations and Unlabeled Experience

Add code
Bookmark button
Alert button
Nov 27, 2020
Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Caglar Gulcehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott Reed

Figure 1 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 2 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 3 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 4 for Offline Learning from Demonstrations and Unlabeled Experience
Viaarxiv icon

Hyperparameter Selection for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 17, 2020
Tom Le Paine, Cosmin Paduraru, Andrea Michi, Caglar Gulcehre, Konrad Zolna, Alexander Novikov, Ziyu Wang, Nando de Freitas

Figure 1 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 2 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 3 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 4 for Hyperparameter Selection for Offline Reinforcement Learning
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Critic Regularized Regression

Add code
Bookmark button
Alert button
Jun 26, 2020
Ziyu Wang, Alexander Novikov, Konrad Żołna, Jost Tobias Springenberg, Scott Reed, Bobak Shahriari, Noah Siegel, Josh Merel, Caglar Gulcehre, Nicolas Heess, Nando de Freitas

Figure 1 for Critic Regularized Regression
Figure 2 for Critic Regularized Regression
Figure 3 for Critic Regularized Regression
Figure 4 for Critic Regularized Regression
Viaarxiv icon

Acme: A Research Framework for Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2020
Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alex Novikov, Sergio Gómez Colmenarejo, Serkan Cabi, Caglar Gulcehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas

Figure 1 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 2 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 3 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 4 for Acme: A Research Framework for Distributed Reinforcement Learning
Viaarxiv icon

Improving the Gating Mechanism of Recurrent Neural Networks

Add code
Bookmark button
Alert button
Oct 22, 2019
Albert Gu, Caglar Gulcehre, Tom Le Paine, Matt Hoffman, Razvan Pascanu

Figure 1 for Improving the Gating Mechanism of Recurrent Neural Networks
Figure 2 for Improving the Gating Mechanism of Recurrent Neural Networks
Figure 3 for Improving the Gating Mechanism of Recurrent Neural Networks
Figure 4 for Improving the Gating Mechanism of Recurrent Neural Networks
Viaarxiv icon

Stabilizing Transformers for Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 13, 2019
Emilio Parisotto, H. Francis Song, Jack W. Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant M. Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew M. Botvinick, Nicolas Heess, Raia Hadsell

Figure 1 for Stabilizing Transformers for Reinforcement Learning
Figure 2 for Stabilizing Transformers for Reinforcement Learning
Figure 3 for Stabilizing Transformers for Reinforcement Learning
Figure 4 for Stabilizing Transformers for Reinforcement Learning
Viaarxiv icon