Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Explaining Reinforcement Learning Policies through Counterfactual Trajectories

Add code
Jan 29, 2022
Figure 1 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Figure 2 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Figure 3 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Figure 4 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Viaarxiv icon

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents

Add code
Jan 18, 2022
Figure 1 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Figure 2 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Figure 3 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Figure 4 for Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Viaarxiv icon

Target Entropy Annealing for Discrete Soft Actor-Critic

Add code
Dec 06, 2021
Figure 1 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 2 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 3 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 4 for Target Entropy Annealing for Discrete Soft Actor-Critic
Viaarxiv icon

Zero-Shot Text-Guided Object Generation with Dream Fields

Add code
Dec 02, 2021
Figure 1 for Zero-Shot Text-Guided Object Generation with Dream Fields
Figure 2 for Zero-Shot Text-Guided Object Generation with Dream Fields
Figure 3 for Zero-Shot Text-Guided Object Generation with Dream Fields
Figure 4 for Zero-Shot Text-Guided Object Generation with Dream Fields
Viaarxiv icon

Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL

Add code
Dec 02, 2021
Figure 1 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Figure 2 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Figure 3 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Figure 4 for Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Viaarxiv icon

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning

Add code
Nov 28, 2021
Figure 1 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 2 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 3 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 4 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Viaarxiv icon

Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning

Add code
Nov 04, 2021
Figure 1 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Figure 2 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Figure 3 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Figure 4 for Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Viaarxiv icon

B-Pref: Benchmarking Preference-Based Reinforcement Learning

Add code
Nov 04, 2021
Figure 1 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 2 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 3 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 4 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Viaarxiv icon

Mastering Atari Games with Limited Data

Add code
Oct 30, 2021
Figure 1 for Mastering Atari Games with Limited Data
Figure 2 for Mastering Atari Games with Limited Data
Figure 3 for Mastering Atari Games with Limited Data
Figure 4 for Mastering Atari Games with Limited Data
Viaarxiv icon

URLB: Unsupervised Reinforcement Learning Benchmark

Add code
Oct 28, 2021
Figure 1 for URLB: Unsupervised Reinforcement Learning Benchmark
Figure 2 for URLB: Unsupervised Reinforcement Learning Benchmark
Figure 3 for URLB: Unsupervised Reinforcement Learning Benchmark
Figure 4 for URLB: Unsupervised Reinforcement Learning Benchmark
Viaarxiv icon