Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

Add code
Oct 28, 2021
Figure 1 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 2 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 3 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 4 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Viaarxiv icon

Towards More Generalizable One-shot Visual Imitation Learning

Add code
Oct 26, 2021
Figure 1 for Towards More Generalizable One-shot Visual Imitation Learning
Figure 2 for Towards More Generalizable One-shot Visual Imitation Learning
Figure 3 for Towards More Generalizable One-shot Visual Imitation Learning
Figure 4 for Towards More Generalizable One-shot Visual Imitation Learning
Viaarxiv icon

APS: Active Pretraining with Successor Features

Add code
Aug 31, 2021
Figure 1 for APS: Active Pretraining with Successor Features
Figure 2 for APS: Active Pretraining with Successor Features
Figure 3 for APS: Active Pretraining with Successor Features
Figure 4 for APS: Active Pretraining with Successor Features
Viaarxiv icon

Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Add code
Aug 11, 2021
Figure 1 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Figure 2 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Figure 3 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Figure 4 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Viaarxiv icon

Playful Interactions for Representation Learning

Add code
Jul 19, 2021
Figure 1 for Playful Interactions for Representation Learning
Figure 2 for Playful Interactions for Representation Learning
Figure 3 for Playful Interactions for Representation Learning
Figure 4 for Playful Interactions for Representation Learning
Viaarxiv icon

Hierarchical Few-Shot Imitation with Skill Transition Models

Add code
Jul 19, 2021
Figure 1 for Hierarchical Few-Shot Imitation with Skill Transition Models
Figure 2 for Hierarchical Few-Shot Imitation with Skill Transition Models
Figure 3 for Hierarchical Few-Shot Imitation with Skill Transition Models
Figure 4 for Hierarchical Few-Shot Imitation with Skill Transition Models
Viaarxiv icon

The MineRL BASALT Competition on Learning from Human Feedback

Add code
Jul 05, 2021
Figure 1 for The MineRL BASALT Competition on Learning from Human Feedback
Figure 2 for The MineRL BASALT Competition on Learning from Human Feedback
Viaarxiv icon

Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble

Add code
Jul 01, 2021
Figure 1 for Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Figure 2 for Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Figure 3 for Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Figure 4 for Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Viaarxiv icon

Decision Transformer: Reinforcement Learning via Sequence Modeling

Add code
Jun 24, 2021
Figure 1 for Decision Transformer: Reinforcement Learning via Sequence Modeling
Figure 2 for Decision Transformer: Reinforcement Learning via Sequence Modeling
Figure 3 for Decision Transformer: Reinforcement Learning via Sequence Modeling
Figure 4 for Decision Transformer: Reinforcement Learning via Sequence Modeling
Viaarxiv icon

Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments

Add code
Jun 18, 2021
Figure 1 for Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments
Figure 2 for Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments
Figure 3 for Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments
Figure 4 for Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments
Viaarxiv icon