Alert button
Picture for Sergio Gómez Colmenarejo

Sergio Gómez Colmenarejo

Alert button

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 07, 2023
Michaël Mathieu, Sherjil Ozair, Srivatsan Srinivasan, Caglar Gulcehre, Shangtong Zhang, Ray Jiang, Tom Le Paine, Richard Powell, Konrad Żołna, Julian Schrittwieser, David Choi, Petko Georgiev, Daniel Toyama, Aja Huang, Roman Ring, Igor Babuschkin, Timo Ewalds, Mahyar Bordbar, Sarah Henderson, Sergio Gómez Colmenarejo, Aäron van den Oord, Wojciech Marian Czarnecki, Nando de Freitas, Oriol Vinyals

Figure 1 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 2 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 3 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 4 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Viaarxiv icon

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

Add code
Bookmark button
Alert button
Jun 20, 2023
Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz, Abbas Abdolmaleki, Oliver Groth, Jean-Baptiste Regli, Oleg Sushkov, Tom Rothörl, José Enrique Chen, Yusuf Aytar, Dave Barker, Joy Ortiz, Martin Riedmiller, Jost Tobias Springenberg, Raia Hadsell, Francesco Nori, Nicolas Heess

Viaarxiv icon

Regularized Behavior Value Estimation

Add code
Bookmark button
Alert button
Mar 17, 2021
Caglar Gulcehre, Sergio Gómez Colmenarejo, Ziyu Wang, Jakub Sygnowski, Thomas Paine, Konrad Zolna, Yutian Chen, Matthew Hoffman, Razvan Pascanu, Nando de Freitas

Figure 1 for Regularized Behavior Value Estimation
Figure 2 for Regularized Behavior Value Estimation
Figure 3 for Regularized Behavior Value Estimation
Figure 4 for Regularized Behavior Value Estimation
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 24, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gómez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Acme: A Research Framework for Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2020
Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alex Novikov, Sergio Gómez Colmenarejo, Serkan Cabi, Caglar Gulcehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas

Figure 1 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 2 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 3 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 4 for Acme: A Research Framework for Distributed Reinforcement Learning
Viaarxiv icon

A Framework for Data-Driven Robotics

Add code
Bookmark button
Alert button
Sep 26, 2019
Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Żołna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

Figure 1 for A Framework for Data-Driven Robotics
Figure 2 for A Framework for Data-Driven Robotics
Figure 3 for A Framework for Data-Driven Robotics
Figure 4 for A Framework for Data-Driven Robotics
Viaarxiv icon

TF-Replicator: Distributed Machine Learning for Researchers

Add code
Bookmark button
Alert button
Feb 01, 2019
Peter Buchlovsky, David Budden, Dominik Grewe, Chris Jones, John Aslanides, Frederic Besse, Andy Brock, Aidan Clark, Sergio Gómez Colmenarejo, Aedan Pope, Fabio Viola, Dan Belov

Figure 1 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 2 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 3 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 4 for TF-Replicator: Distributed Machine Learning for Researchers
Viaarxiv icon

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL

Add code
Bookmark button
Alert button
Oct 11, 2018
Tom Le Paine, Sergio Gómez Colmenarejo, Ziyu Wang, Scott Reed, Yusuf Aytar, Tobias Pfaff, Matt W. Hoffman, Gabriel Barth-Maron, Serkan Cabi, David Budden, Nando de Freitas

Figure 1 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 2 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 3 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 4 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Viaarxiv icon

Learning Awareness Models

Add code
Bookmark button
Alert button
Apr 17, 2018
Brandon Amos, Laurent Dinh, Serkan Cabi, Thomas Rothörl, Sergio Gómez Colmenarejo, Alistair Muldal, Tom Erez, Yuval Tassa, Nando de Freitas, Misha Denil

Figure 1 for Learning Awareness Models
Figure 2 for Learning Awareness Models
Figure 3 for Learning Awareness Models
Figure 4 for Learning Awareness Models
Viaarxiv icon

The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously

Add code
Bookmark button
Alert button
Jul 11, 2017
Serkan Cabi, Sergio Gómez Colmenarejo, Matthew W. Hoffman, Misha Denil, Ziyu Wang, Nando de Freitas

Figure 1 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 2 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 3 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Figure 4 for The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Viaarxiv icon