Alert button
Picture for Nando de Freitas

Nando de Freitas

Alert button

Genie: Generative Interactive Environments

Add code
Bookmark button
Alert button
Feb 23, 2024
Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel

Viaarxiv icon

Reinforced Self-Training (ReST) for Language Modeling

Add code
Bookmark button
Alert button
Aug 21, 2023
Caglar Gulcehre, Tom Le Paine, Srivatsan Srinivasan, Ksenia Konyushkova, Lotte Weerts, Abhishek Sharma, Aditya Siddhant, Alex Ahern, Miaosen Wang, Chenjie Gu, Wolfgang Macherey, Arnaud Doucet, Orhan Firat, Nando de Freitas

Figure 1 for Reinforced Self-Training (ReST) for Language Modeling
Figure 2 for Reinforced Self-Training (ReST) for Language Modeling
Figure 3 for Reinforced Self-Training (ReST) for Language Modeling
Figure 4 for Reinforced Self-Training (ReST) for Language Modeling
Viaarxiv icon

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 07, 2023
Michaël Mathieu, Sherjil Ozair, Srivatsan Srinivasan, Caglar Gulcehre, Shangtong Zhang, Ray Jiang, Tom Le Paine, Richard Powell, Konrad Żołna, Julian Schrittwieser, David Choi, Petko Georgiev, Daniel Toyama, Aja Huang, Roman Ring, Igor Babuschkin, Timo Ewalds, Mahyar Bordbar, Sarah Henderson, Sergio Gómez Colmenarejo, Aäron van den Oord, Wojciech Marian Czarnecki, Nando de Freitas, Oriol Vinyals

Figure 1 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 2 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 3 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 4 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Viaarxiv icon

Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning

Add code
Bookmark button
Alert button
May 09, 2023
Patrick Emedom-Nnamdi, Abram L. Friesen, Bobak Shahriari, Nando de Freitas, Matt W. Hoffman

Figure 1 for Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
Figure 2 for Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
Figure 3 for Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
Figure 4 for Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
Viaarxiv icon

Vision-Language Models as Success Detectors

Add code
Bookmark button
Alert button
Mar 13, 2023
Yuqing Du, Ksenia Konyushkova, Misha Denil, Akhil Raju, Jessica Landon, Felix Hill, Nando de Freitas, Serkan Cabi

Figure 1 for Vision-Language Models as Success Detectors
Figure 2 for Vision-Language Models as Success Detectors
Figure 3 for Vision-Language Models as Success Detectors
Figure 4 for Vision-Language Models as Success Detectors
Viaarxiv icon

Towards Learning Universal Hyperparameter Optimizers with Transformers

Add code
Bookmark button
Alert button
May 26, 2022
Yutian Chen, Xingyou Song, Chansoo Lee, Zi Wang, Qiuyi Zhang, David Dohan, Kazuya Kawakami, Greg Kochanski, Arnaud Doucet, Marc'aurelio Ranzato, Sagi Perel, Nando de Freitas

Figure 1 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 2 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 3 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 4 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Viaarxiv icon

A Generalist Agent

Add code
Bookmark button
Alert button
May 19, 2022
Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

Figure 1 for A Generalist Agent
Figure 2 for A Generalist Agent
Figure 3 for A Generalist Agent
Figure 4 for A Generalist Agent
Viaarxiv icon

Shaking the foundations: delusions in sequence models for interaction and control

Add code
Bookmark button
Alert button
Oct 20, 2021
Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

Figure 1 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 2 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 3 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 4 for Shaking the foundations: delusions in sequence models for interaction and control
Viaarxiv icon