Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Coarse-to-Fine Q-attention with Learned Path Ranking

Add code
Apr 04, 2022
Figure 1 for Coarse-to-Fine Q-attention with Learned Path Ranking
Figure 2 for Coarse-to-Fine Q-attention with Learned Path Ranking
Figure 3 for Coarse-to-Fine Q-attention with Learned Path Ranking
Figure 4 for Coarse-to-Fine Q-attention with Learned Path Ranking
Viaarxiv icon

Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design

Add code
Apr 01, 2022
Figure 1 for Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design
Figure 2 for Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design
Figure 3 for Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design
Figure 4 for Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design
Viaarxiv icon

Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions

Add code
Mar 28, 2022
Figure 1 for Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions
Figure 2 for Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions
Figure 3 for Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions
Figure 4 for Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions
Viaarxiv icon

Reinforcement Learning with Action-Free Pre-Training from Videos

Add code
Mar 25, 2022
Figure 1 for Reinforcement Learning with Action-Free Pre-Training from Videos
Figure 2 for Reinforcement Learning with Action-Free Pre-Training from Videos
Figure 3 for Reinforcement Learning with Action-Free Pre-Training from Videos
Figure 4 for Reinforcement Learning with Action-Free Pre-Training from Videos
Viaarxiv icon

Teachable Reinforcement Learning via Advice Distillation

Add code
Mar 19, 2022
Figure 1 for Teachable Reinforcement Learning via Advice Distillation
Figure 2 for Teachable Reinforcement Learning via Advice Distillation
Figure 3 for Teachable Reinforcement Learning via Advice Distillation
Figure 4 for Teachable Reinforcement Learning via Advice Distillation
Viaarxiv icon

SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning

Add code
Mar 18, 2022
Figure 1 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Figure 2 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Figure 3 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Figure 4 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Viaarxiv icon

It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation

Add code
Feb 22, 2022
Figure 1 for It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Figure 2 for It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Figure 3 for It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Figure 4 for It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Viaarxiv icon

Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

Add code
Feb 08, 2022
Figure 1 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Figure 2 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Figure 3 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Figure 4 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Viaarxiv icon

Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

Add code
Feb 08, 2022
Viaarxiv icon

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Add code
Feb 01, 2022
Figure 1 for CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Figure 2 for CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Figure 3 for CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Figure 4 for CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Viaarxiv icon