Alert button
Picture for Josh Abramson

Josh Abramson

Alert button

Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Nov 21, 2022
Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 2 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 3 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 4 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Viaarxiv icon

Intra-agent speech permits zero-shot task acquisition

Jun 07, 2022
Chen Yan, Federico Carnevale, Petko Georgiev, Adam Santoro, Aurelia Guy, Alistair Muldal, Chia-Chun Hung, Josh Abramson, Timothy Lillicrap, Gregory Wayne

Figure 1 for Intra-agent speech permits zero-shot task acquisition
Figure 2 for Intra-agent speech permits zero-shot task acquisition
Figure 3 for Intra-agent speech permits zero-shot task acquisition
Figure 4 for Intra-agent speech permits zero-shot task acquisition
Viaarxiv icon

Evaluating Multimodal Interactive Agents

May 26, 2022
Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Timothy Lillicrap, Alistair Muldal, Blake Richards, Adam Santoro, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan

Figure 1 for Evaluating Multimodal Interactive Agents
Figure 2 for Evaluating Multimodal Interactive Agents
Figure 3 for Evaluating Multimodal Interactive Agents
Figure 4 for Evaluating Multimodal Interactive Agents
Viaarxiv icon

A data-driven approach for learning to control computers

Feb 16, 2022
Peter C Humphreys, David Raposo, Toby Pohlen, Gregory Thornton, Rachita Chhaparia, Alistair Muldal, Josh Abramson, Petko Georgiev, Alex Goldin, Adam Santoro, Timothy Lillicrap

Figure 1 for A data-driven approach for learning to control computers
Figure 2 for A data-driven approach for learning to control computers
Figure 3 for A data-driven approach for learning to control computers
Figure 4 for A data-driven approach for learning to control computers
Viaarxiv icon

Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

Dec 07, 2021
DeepMind Interactive Agents Team, Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 2 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 3 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 4 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Viaarxiv icon

Imitating Interactive Intelligence

Jan 21, 2021
Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne, Duncan Williams, Nathaniel Wong, Chen Yan, Rui Zhu

Figure 1 for Imitating Interactive Intelligence
Figure 2 for Imitating Interactive Intelligence
Figure 3 for Imitating Interactive Intelligence
Figure 4 for Imitating Interactive Intelligence
Viaarxiv icon

Probing Emergent Semantics in Predictive Agents via Question Answering

Jun 01, 2020
Abhishek Das, Federico Carnevale, Hamza Merzic, Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Gregory Wayne, Felix Hill

Figure 1 for Probing Emergent Semantics in Predictive Agents via Question Answering
Figure 2 for Probing Emergent Semantics in Predictive Agents via Question Answering
Figure 3 for Probing Emergent Semantics in Predictive Agents via Question Answering
Figure 4 for Probing Emergent Semantics in Predictive Agents via Question Answering
Viaarxiv icon

Optimizing Agent Behavior over Long Time Scales by Transporting Value

Oct 15, 2018
Chia-Chun Hung, Timothy Lillicrap, Josh Abramson, Yan Wu, Mehdi Mirza, Federico Carnevale, Arun Ahuja, Greg Wayne

Figure 1 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Figure 2 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Figure 3 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Figure 4 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Viaarxiv icon

Unsupervised Predictive Memory in a Goal-Directed Agent

Mar 28, 2018
Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap

Figure 1 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 2 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 3 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 4 for Unsupervised Predictive Memory in a Goal-Directed Agent
Viaarxiv icon