Alert button
Picture for Alistair Muldal

Alistair Muldal

Alert button

Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Nov 21, 2022
Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 2 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 3 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 4 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Viaarxiv icon

Intra-agent speech permits zero-shot task acquisition

Add code
Bookmark button
Alert button
Jun 07, 2022
Chen Yan, Federico Carnevale, Petko Georgiev, Adam Santoro, Aurelia Guy, Alistair Muldal, Chia-Chun Hung, Josh Abramson, Timothy Lillicrap, Gregory Wayne

Figure 1 for Intra-agent speech permits zero-shot task acquisition
Figure 2 for Intra-agent speech permits zero-shot task acquisition
Figure 3 for Intra-agent speech permits zero-shot task acquisition
Figure 4 for Intra-agent speech permits zero-shot task acquisition
Viaarxiv icon

Evaluating Multimodal Interactive Agents

Add code
Bookmark button
Alert button
May 26, 2022
Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Timothy Lillicrap, Alistair Muldal, Blake Richards, Adam Santoro, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan

Figure 1 for Evaluating Multimodal Interactive Agents
Figure 2 for Evaluating Multimodal Interactive Agents
Figure 3 for Evaluating Multimodal Interactive Agents
Figure 4 for Evaluating Multimodal Interactive Agents
Viaarxiv icon

A data-driven approach for learning to control computers

Add code
Bookmark button
Alert button
Feb 16, 2022
Peter C Humphreys, David Raposo, Toby Pohlen, Gregory Thornton, Rachita Chhaparia, Alistair Muldal, Josh Abramson, Petko Georgiev, Alex Goldin, Adam Santoro, Timothy Lillicrap

Figure 1 for A data-driven approach for learning to control computers
Figure 2 for A data-driven approach for learning to control computers
Figure 3 for A data-driven approach for learning to control computers
Figure 4 for A data-driven approach for learning to control computers
Viaarxiv icon

Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

Add code
Bookmark button
Alert button
Dec 07, 2021
DeepMind Interactive Agents Team, Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 2 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 3 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 4 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Viaarxiv icon

Imitating Interactive Intelligence

Add code
Bookmark button
Alert button
Jan 21, 2021
Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne, Duncan Williams, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Imitating Interactive Intelligence
Figure 2 for Imitating Interactive Intelligence
Figure 3 for Imitating Interactive Intelligence
Figure 4 for Imitating Interactive Intelligence
Viaarxiv icon

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 11, 2020
Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Peter Karkus, Sébastien Racanière, Lars Buesing, Timothy Lillicrap, Nicolas Heess

Figure 1 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 2 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 3 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 4 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Viaarxiv icon

dm_control: Software and Tasks for Continuous Control

Add code
Bookmark button
Alert button
Jun 22, 2020
Yuval Tassa, Saran Tunyasuvunakool, Alistair Muldal, Yotam Doron, Siqi Liu, Steven Bohez, Josh Merel, Tom Erez, Timothy Lillicrap, Nicolas Heess

Figure 1 for dm_control: Software and Tasks for Continuous Control
Figure 2 for dm_control: Software and Tasks for Continuous Control
Figure 3 for dm_control: Software and Tasks for Continuous Control
Figure 4 for dm_control: Software and Tasks for Continuous Control
Viaarxiv icon

Distributed Distributional Deterministic Policy Gradients

Add code
Bookmark button
Alert button
Apr 23, 2018
Gabriel Barth-Maron, Matthew W. Hoffman, David Budden, Will Dabney, Dan Horgan, Dhruva TB, Alistair Muldal, Nicolas Heess, Timothy Lillicrap

Figure 1 for Distributed Distributional Deterministic Policy Gradients
Figure 2 for Distributed Distributional Deterministic Policy Gradients
Figure 3 for Distributed Distributional Deterministic Policy Gradients
Figure 4 for Distributed Distributional Deterministic Policy Gradients
Viaarxiv icon