Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Chain of Hindsight Aligns Language Models with Feedback

Add code
Feb 27, 2023
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Feb 23, 2023
Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning

Add code
Feb 19, 2023
Figure 1 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Figure 2 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Figure 3 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Figure 4 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Viaarxiv icon

Guiding Pretraining in Reinforcement Learning with Large Language Models

Add code
Feb 13, 2023
Figure 1 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 2 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 3 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 4 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Viaarxiv icon

Controllability-Aware Unsupervised Skill Discovery

Add code
Feb 13, 2023
Viaarxiv icon

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Add code
Feb 10, 2023
Figure 1 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Figure 2 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Figure 3 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Figure 4 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Viaarxiv icon

Multi-View Masked World Models for Visual Robotic Manipulation

Add code
Feb 05, 2023
Figure 1 for Multi-View Masked World Models for Visual Robotic Manipulation
Figure 2 for Multi-View Masked World Models for Visual Robotic Manipulation
Figure 3 for Multi-View Masked World Models for Visual Robotic Manipulation
Figure 4 for Multi-View Masked World Models for Visual Robotic Manipulation
Viaarxiv icon

Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment

Add code
Feb 03, 2023
Figure 1 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Figure 2 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Figure 3 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Figure 4 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Viaarxiv icon

Learning Universal Policies via Text-Guided Video Generation

Add code
Feb 02, 2023
Figure 1 for Learning Universal Policies via Text-Guided Video Generation
Figure 2 for Learning Universal Policies via Text-Guided Video Generation
Figure 3 for Learning Universal Policies via Text-Guided Video Generation
Figure 4 for Learning Universal Policies via Text-Guided Video Generation
Viaarxiv icon

Multi-Environment Pretraining Enables Transfer to Action Limited Datasets

Add code
Dec 05, 2022
Viaarxiv icon