Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Preference Transformer: Modeling Human Preferences using Transformers for RL

Add code
Mar 02, 2023
Viaarxiv icon

Chain of Hindsight Aligns Language Models with Feedback

Add code
Feb 27, 2023
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Feb 23, 2023
Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning

Add code
Feb 19, 2023
Figure 1 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Figure 2 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Figure 3 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Figure 4 for Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning
Viaarxiv icon

Guiding Pretraining in Reinforcement Learning with Large Language Models

Add code
Feb 13, 2023
Figure 1 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 2 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 3 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 4 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Viaarxiv icon

Controllability-Aware Unsupervised Skill Discovery

Add code
Feb 13, 2023
Viaarxiv icon

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Add code
Feb 10, 2023
Figure 1 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Figure 2 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Figure 3 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Figure 4 for The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Viaarxiv icon

Multi-View Masked World Models for Visual Robotic Manipulation

Add code
Feb 05, 2023
Figure 1 for Multi-View Masked World Models for Visual Robotic Manipulation
Figure 2 for Multi-View Masked World Models for Visual Robotic Manipulation
Figure 3 for Multi-View Masked World Models for Visual Robotic Manipulation
Figure 4 for Multi-View Masked World Models for Visual Robotic Manipulation
Viaarxiv icon

Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment

Add code
Feb 03, 2023
Figure 1 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Figure 2 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Figure 3 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Figure 4 for Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Viaarxiv icon

Learning Universal Policies via Text-Guided Video Generation

Add code
Feb 02, 2023
Figure 1 for Learning Universal Policies via Text-Guided Video Generation
Figure 2 for Learning Universal Policies via Text-Guided Video Generation
Figure 3 for Learning Universal Policies via Text-Guided Video Generation
Figure 4 for Learning Universal Policies via Text-Guided Video Generation
Viaarxiv icon