Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Add code
Sep 16, 2022
Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Add code
Sep 15, 2022
Figure 1 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Figure 2 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Figure 3 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Figure 4 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Viaarxiv icon

Multi-Objective Policy Gradients with Topological Constraints

Add code
Sep 15, 2022
Figure 1 for Multi-Objective Policy Gradients with Topological Constraints
Figure 2 for Multi-Objective Policy Gradients with Topological Constraints
Figure 3 for Multi-Objective Policy Gradients with Topological Constraints
Viaarxiv icon

AdaCat: Adaptive Categorical Discretization for Autoregressive Models

Add code
Aug 03, 2022
Figure 1 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Figure 2 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Figure 3 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Figure 4 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Viaarxiv icon

Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision

Add code
Jun 29, 2022
Figure 1 for Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Figure 2 for Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Figure 3 for Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Figure 4 for Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Viaarxiv icon

Masked World Models for Visual Control

Add code
Jun 28, 2022
Figure 1 for Masked World Models for Visual Control
Figure 2 for Masked World Models for Visual Control
Figure 3 for Masked World Models for Visual Control
Figure 4 for Masked World Models for Visual Control
Viaarxiv icon

DayDreamer: World Models for Physical Robot Learning

Add code
Jun 28, 2022
Figure 1 for DayDreamer: World Models for Physical Robot Learning
Figure 2 for DayDreamer: World Models for Physical Robot Learning
Figure 3 for DayDreamer: World Models for Physical Robot Learning
Figure 4 for DayDreamer: World Models for Physical Robot Learning
Viaarxiv icon

Patch-based Object-centric Transformers for Efficient Video Generation

Add code
Jun 19, 2022
Figure 1 for Patch-based Object-centric Transformers for Efficient Video Generation
Figure 2 for Patch-based Object-centric Transformers for Efficient Video Generation
Figure 3 for Patch-based Object-centric Transformers for Efficient Video Generation
Figure 4 for Patch-based Object-centric Transformers for Efficient Video Generation
Viaarxiv icon

Deep Hierarchical Planning from Pixels

Add code
Jun 08, 2022
Figure 1 for Deep Hierarchical Planning from Pixels
Figure 2 for Deep Hierarchical Planning from Pixels
Figure 3 for Deep Hierarchical Planning from Pixels
Figure 4 for Deep Hierarchical Planning from Pixels
Viaarxiv icon

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Add code
Jun 07, 2022
Figure 1 for On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Figure 2 for On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Figure 3 for On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Figure 4 for On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Viaarxiv icon