Picture for Sergey Levine

Sergey Levine

Stanford University

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning

Add code
Nov 07, 2024
Figure 1 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Figure 2 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Figure 3 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Figure 4 for Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Viaarxiv icon

Learning to Assist Humans without Inferring Rewards

Add code
Nov 04, 2024
Viaarxiv icon

$π_0$: A Vision-Language-Action Flow Model for General Robot Control

Add code
Oct 31, 2024
Figure 1 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 2 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 3 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 4 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Viaarxiv icon

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Add code
Oct 29, 2024
Viaarxiv icon

OGBench: Benchmarking Offline Goal-Conditioned RL

Add code
Oct 26, 2024
Viaarxiv icon

GHIL-Glue: Hierarchical Control with Filtered Subgoal Images

Add code
Oct 26, 2024
Figure 1 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Figure 2 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Figure 3 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Figure 4 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Viaarxiv icon

Prioritized Generative Replay

Add code
Oct 23, 2024
Viaarxiv icon

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Add code
Oct 23, 2024
Figure 1 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Figure 2 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Figure 3 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Figure 4 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Viaarxiv icon

Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design

Add code
Oct 17, 2024
Figure 1 for Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Figure 2 for Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Figure 3 for Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Figure 4 for Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Viaarxiv icon

Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance

Add code
Oct 17, 2024
Figure 1 for Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Figure 2 for Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Figure 3 for Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Figure 4 for Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Viaarxiv icon