Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

Add code
Jan 12, 2024
Figure 1 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 2 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 3 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 4 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Viaarxiv icon

Any-point Trajectory Modeling for Policy Learning

Add code
Dec 28, 2023
Figure 1 for Any-point Trajectory Modeling for Policy Learning
Figure 2 for Any-point Trajectory Modeling for Policy Learning
Figure 3 for Any-point Trajectory Modeling for Policy Learning
Figure 4 for Any-point Trajectory Modeling for Policy Learning
Viaarxiv icon

Learning a Diffusion Model Policy from Rewards via Q-Score Matching

Add code
Dec 18, 2023
Figure 1 for Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Figure 2 for Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Figure 3 for Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Figure 4 for Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Viaarxiv icon

Motion-Conditioned Image Animation for Video Editing

Add code
Nov 30, 2023
Figure 1 for Motion-Conditioned Image Animation for Video Editing
Figure 2 for Motion-Conditioned Image Animation for Video Editing
Figure 3 for Motion-Conditioned Image Animation for Video Editing
Figure 4 for Motion-Conditioned Image Animation for Video Editing
Viaarxiv icon

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation

Add code
Nov 03, 2023
Figure 1 for AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Figure 2 for AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Figure 3 for AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Figure 4 for AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Viaarxiv icon

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Add code
Nov 02, 2023
Viaarxiv icon

The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning

Add code
Nov 02, 2023
Figure 1 for The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning
Figure 2 for The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning
Figure 3 for The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning
Figure 4 for The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning
Viaarxiv icon

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

Add code
Nov 02, 2023
Figure 1 for DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
Figure 2 for DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
Figure 3 for DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
Figure 4 for DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
Viaarxiv icon

Managing AI Risks in an Era of Rapid Progress

Add code
Oct 26, 2023
Viaarxiv icon

Video Language Planning

Add code
Oct 16, 2023
Viaarxiv icon