Picture for Sergey Levine

Sergey Levine

Stanford University

Is Value Learning Really the Main Bottleneck in Offline RL?

Add code
Jun 13, 2024
Viaarxiv icon

Language Guided Skill Discovery

Add code
Jun 07, 2024
Figure 1 for Language Guided Skill Discovery
Figure 2 for Language Guided Skill Discovery
Figure 3 for Language Guided Skill Discovery
Figure 4 for Language Guided Skill Discovery
Viaarxiv icon

Strategically Conservative Q-Learning

Add code
Jun 06, 2024
Figure 1 for Strategically Conservative Q-Learning
Figure 2 for Strategically Conservative Q-Learning
Figure 3 for Strategically Conservative Q-Learning
Figure 4 for Strategically Conservative Q-Learning
Viaarxiv icon

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

Add code
May 31, 2024
Figure 1 for Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Figure 2 for Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Figure 3 for Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Figure 4 for Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Viaarxiv icon

Octo: An Open-Source Generalist Robot Policy

Add code
May 20, 2024
Figure 1 for Octo: An Open-Source Generalist Robot Policy
Figure 2 for Octo: An Open-Source Generalist Robot Policy
Figure 3 for Octo: An Open-Source Generalist Robot Policy
Figure 4 for Octo: An Open-Source Generalist Robot Policy
Viaarxiv icon

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Figure 1 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 2 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 3 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 4 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Viaarxiv icon

Evaluating Real-World Robot Manipulation Policies in Simulation

Add code
May 09, 2024
Figure 1 for Evaluating Real-World Robot Manipulation Policies in Simulation
Figure 2 for Evaluating Real-World Robot Manipulation Policies in Simulation
Figure 3 for Evaluating Real-World Robot Manipulation Policies in Simulation
Figure 4 for Evaluating Real-World Robot Manipulation Policies in Simulation
Viaarxiv icon

RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes

Add code
May 07, 2024
Figure 1 for RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Figure 2 for RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Figure 3 for RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Figure 4 for RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Viaarxiv icon

Learning Visuotactile Skills with Two Multifingered Hands

Add code
Apr 25, 2024
Figure 1 for Learning Visuotactile Skills with Two Multifingered Hands
Figure 2 for Learning Visuotactile Skills with Two Multifingered Hands
Figure 3 for Learning Visuotactile Skills with Two Multifingered Hands
Figure 4 for Learning Visuotactile Skills with Two Multifingered Hands
Viaarxiv icon

Autonomous Evaluation and Refinement of Digital Agents

Add code
Apr 10, 2024
Viaarxiv icon