Alert button
Picture for Greg Wayne

Greg Wayne

Alert button

Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Nov 21, 2022
Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 2 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 3 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 4 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Viaarxiv icon

Evaluating Multimodal Interactive Agents

Add code
Bookmark button
Alert button
May 26, 2022
Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Timothy Lillicrap, Alistair Muldal, Blake Richards, Adam Santoro, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan

Figure 1 for Evaluating Multimodal Interactive Agents
Figure 2 for Evaluating Multimodal Interactive Agents
Figure 3 for Evaluating Multimodal Interactive Agents
Figure 4 for Evaluating Multimodal Interactive Agents
Viaarxiv icon

Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

Add code
Bookmark button
Alert button
Dec 07, 2021
DeepMind Interactive Agents Team, Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 2 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 3 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 4 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Viaarxiv icon

Imitation by Predicting Observations

Add code
Bookmark button
Alert button
Jul 08, 2021
Andrew Jaegle, Yury Sulsky, Arun Ahuja, Jake Bruce, Rob Fergus, Greg Wayne

Figure 1 for Imitation by Predicting Observations
Figure 2 for Imitation by Predicting Observations
Figure 3 for Imitation by Predicting Observations
Figure 4 for Imitation by Predicting Observations
Viaarxiv icon

Synthetic Returns for Long-Term Credit Assignment

Add code
Bookmark button
Alert button
Feb 24, 2021
David Raposo, Sam Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt Botvinick, Hado van Hasselt, Francis Song

Figure 1 for Synthetic Returns for Long-Term Credit Assignment
Figure 2 for Synthetic Returns for Long-Term Credit Assignment
Figure 3 for Synthetic Returns for Long-Term Credit Assignment
Figure 4 for Synthetic Returns for Long-Term Credit Assignment
Viaarxiv icon

Imitating Interactive Intelligence

Add code
Bookmark button
Alert button
Jan 21, 2021
Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne, Duncan Williams, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Imitating Interactive Intelligence
Figure 2 for Imitating Interactive Intelligence
Figure 3 for Imitating Interactive Intelligence
Figure 4 for Imitating Interactive Intelligence
Viaarxiv icon

Gaussian Gated Linear Networks

Add code
Bookmark button
Alert button
Jun 10, 2020
David Budden, Adam Marblestone, Eren Sezener, Tor Lattimore, Greg Wayne, Joel Veness

Figure 1 for Gaussian Gated Linear Networks
Figure 2 for Gaussian Gated Linear Networks
Figure 3 for Gaussian Gated Linear Networks
Figure 4 for Gaussian Gated Linear Networks
Viaarxiv icon

Product Kanerva Machines: Factorized Bayesian Memory

Add code
Bookmark button
Alert button
Feb 06, 2020
Adam Marblestone, Yan Wu, Greg Wayne

Figure 1 for Product Kanerva Machines: Factorized Bayesian Memory
Figure 2 for Product Kanerva Machines: Factorized Bayesian Memory
Figure 3 for Product Kanerva Machines: Factorized Bayesian Memory
Figure 4 for Product Kanerva Machines: Factorized Bayesian Memory
Viaarxiv icon

Hindsight Credit Assignment

Add code
Bookmark button
Alert button
Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

Figure 1 for Hindsight Credit Assignment
Figure 2 for Hindsight Credit Assignment
Figure 3 for Hindsight Credit Assignment
Figure 4 for Hindsight Credit Assignment
Viaarxiv icon