Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Compute-Optimal Scaling for Value-Based Deep RL

Add code
Aug 20, 2025
Viaarxiv icon

MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real

Add code
Jul 03, 2025
Viaarxiv icon

SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending

Add code
Jun 11, 2025
Viaarxiv icon

A Stable Whitening Optimizer for Efficient Neural Network Training

Add code
Jun 08, 2025
Figure 1 for A Stable Whitening Optimizer for Efficient Neural Network Training
Figure 2 for A Stable Whitening Optimizer for Efficient Neural Network Training
Figure 3 for A Stable Whitening Optimizer for Efficient Neural Network Training
Figure 4 for A Stable Whitening Optimizer for Efficient Neural Network Training
Viaarxiv icon

Object-centric 3D Motion Field for Robot Learning from Human Videos

Add code
Jun 04, 2025
Viaarxiv icon

Diffusion Guidance Is a Controllable Policy Improvement Operator

Add code
May 29, 2025
Figure 1 for Diffusion Guidance Is a Controllable Policy Improvement Operator
Figure 2 for Diffusion Guidance Is a Controllable Policy Improvement Operator
Figure 3 for Diffusion Guidance Is a Controllable Policy Improvement Operator
Figure 4 for Diffusion Guidance Is a Controllable Policy Improvement Operator
Viaarxiv icon

FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control

Add code
May 29, 2025
Viaarxiv icon

Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners

Add code
May 29, 2025
Viaarxiv icon

EgoZero: Robot Learning from Smart Glasses

Add code
May 26, 2025
Figure 1 for EgoZero: Robot Learning from Smart Glasses
Figure 2 for EgoZero: Robot Learning from Smart Glasses
Figure 3 for EgoZero: Robot Learning from Smart Glasses
Figure 4 for EgoZero: Robot Learning from Smart Glasses
Viaarxiv icon

DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy

Add code
May 19, 2025
Viaarxiv icon