Picture for Alistair Muldal

Alistair Muldal

Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Add code
Nov 21, 2022
Figure 1 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 2 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 3 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 4 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Viaarxiv icon

Intra-agent speech permits zero-shot task acquisition

Add code
Jun 07, 2022
Figure 1 for Intra-agent speech permits zero-shot task acquisition
Figure 2 for Intra-agent speech permits zero-shot task acquisition
Figure 3 for Intra-agent speech permits zero-shot task acquisition
Figure 4 for Intra-agent speech permits zero-shot task acquisition
Viaarxiv icon

Evaluating Multimodal Interactive Agents

Add code
May 26, 2022
Figure 1 for Evaluating Multimodal Interactive Agents
Figure 2 for Evaluating Multimodal Interactive Agents
Figure 3 for Evaluating Multimodal Interactive Agents
Figure 4 for Evaluating Multimodal Interactive Agents
Viaarxiv icon

A data-driven approach for learning to control computers

Add code
Feb 16, 2022
Figure 1 for A data-driven approach for learning to control computers
Figure 2 for A data-driven approach for learning to control computers
Figure 3 for A data-driven approach for learning to control computers
Figure 4 for A data-driven approach for learning to control computers
Viaarxiv icon

Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

Add code
Dec 07, 2021
Figure 1 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 2 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 3 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 4 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Viaarxiv icon

Imitating Interactive Intelligence

Add code
Jan 21, 2021
Figure 1 for Imitating Interactive Intelligence
Figure 2 for Imitating Interactive Intelligence
Figure 3 for Imitating Interactive Intelligence
Figure 4 for Imitating Interactive Intelligence
Viaarxiv icon

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

Add code
Sep 11, 2020
Figure 1 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 2 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 3 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 4 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Viaarxiv icon

dm_control: Software and Tasks for Continuous Control

Add code
Jun 22, 2020
Figure 1 for dm_control: Software and Tasks for Continuous Control
Figure 2 for dm_control: Software and Tasks for Continuous Control
Figure 3 for dm_control: Software and Tasks for Continuous Control
Figure 4 for dm_control: Software and Tasks for Continuous Control
Viaarxiv icon

Distributed Distributional Deterministic Policy Gradients

Add code
Apr 23, 2018
Figure 1 for Distributed Distributional Deterministic Policy Gradients
Figure 2 for Distributed Distributional Deterministic Policy Gradients
Figure 3 for Distributed Distributional Deterministic Policy Gradients
Figure 4 for Distributed Distributional Deterministic Policy Gradients
Viaarxiv icon

Learning Awareness Models

Add code
Apr 17, 2018
Figure 1 for Learning Awareness Models
Figure 2 for Learning Awareness Models
Figure 3 for Learning Awareness Models
Figure 4 for Learning Awareness Models
Viaarxiv icon