Alert button
Picture for Pieter Abbeel

Pieter Abbeel

Alert button

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents

Add code
Bookmark button
Alert button
Jan 18, 2022
Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch

Figure 1 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Figure 2 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Figure 3 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Figure 4 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Viaarxiv icon

Target Entropy Annealing for Discrete Soft Actor-Critic

Add code
Bookmark button
Alert button
Dec 06, 2021
Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, Roy Fox

Figure 1 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 2 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 3 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 4 for Target Entropy Annealing for Discrete Soft Actor-Critic
Viaarxiv icon

Zero-Shot Text-Guided Object Generation with Dream Fields

Add code
Bookmark button
Alert button
Dec 02, 2021
Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole

Figure 1 for Zero-Shot Text-Guided Object Generation with Dream Fields
Figure 2 for Zero-Shot Text-Guided Object Generation with Dream Fields
Figure 3 for Zero-Shot Text-Guided Object Generation with Dream Fields
Figure 4 for Zero-Shot Text-Guided Object Generation with Dream Fields
Viaarxiv icon

Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL

Add code
Bookmark button
Alert button
Dec 02, 2021
Charles Packer, Pieter Abbeel, Joseph E. Gonzalez

Figure 1 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Figure 2 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Figure 3 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Figure 4 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Viaarxiv icon

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 28, 2021
Dailin Hu, Pieter Abbeel, Roy Fox

Figure 1 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 2 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 3 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 4 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Viaarxiv icon

Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning

Add code
Bookmark button
Alert button
Nov 04, 2021
Wenlong Huang, Igor Mordatch, Pieter Abbeel, Deepak Pathak

Figure 1 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Figure 2 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Figure 3 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Figure 4 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Viaarxiv icon

B-Pref: Benchmarking Preference-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2021
Kimin Lee, Laura Smith, Anca Dragan, Pieter Abbeel

Figure 1 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 2 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 3 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 4 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Viaarxiv icon

Mastering Atari Games with Limited Data

Add code
Bookmark button
Alert button
Oct 30, 2021
Weirui Ye, Shaohuai Liu, Thanard Kurutach, Pieter Abbeel, Yang Gao

Figure 1 for Mastering Atari Games with Limited Data
Figure 2 for Mastering Atari Games with Limited Data
Figure 3 for Mastering Atari Games with Limited Data
Figure 4 for Mastering Atari Games with Limited Data
Viaarxiv icon

URLB: Unsupervised Reinforcement Learning Benchmark

Add code
Bookmark button
Alert button
Oct 28, 2021
Michael Laskin, Denis Yarats, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel

Figure 1 for URLB: Unsupervised Reinforcement Learning Benchmark
Figure 2 for URLB: Unsupervised Reinforcement Learning Benchmark
Figure 3 for URLB: Unsupervised Reinforcement Learning Benchmark
Figure 4 for URLB: Unsupervised Reinforcement Learning Benchmark
Viaarxiv icon

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

Add code
Bookmark button
Alert button
Oct 28, 2021
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 2 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 3 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 4 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Viaarxiv icon