Picture for Sergey Levine

Sergey Levine

Stanford University

Stabilizing Contrastive RL: Techniques for Offline Goal Reaching

Add code
Jun 06, 2023
Figure 1 for Stabilizing Contrastive RL: Techniques for Offline Goal Reaching
Figure 2 for Stabilizing Contrastive RL: Techniques for Offline Goal Reaching
Figure 3 for Stabilizing Contrastive RL: Techniques for Offline Goal Reaching
Figure 4 for Stabilizing Contrastive RL: Techniques for Offline Goal Reaching
Viaarxiv icon

SACSoN: Scalable Autonomous Data Collection for Social Navigation

Add code
Jun 02, 2023
Viaarxiv icon

The False Promise of Imitating Proprietary LLMs

Add code
May 25, 2023
Figure 1 for The False Promise of Imitating Proprietary LLMs
Figure 2 for The False Promise of Imitating Proprietary LLMs
Figure 3 for The False Promise of Imitating Proprietary LLMs
Figure 4 for The False Promise of Imitating Proprietary LLMs
Viaarxiv icon

Training Diffusion Models with Reinforcement Learning

Add code
May 23, 2023
Figure 1 for Training Diffusion Models with Reinforcement Learning
Figure 2 for Training Diffusion Models with Reinforcement Learning
Figure 3 for Training Diffusion Models with Reinforcement Learning
Figure 4 for Training Diffusion Models with Reinforcement Learning
Viaarxiv icon

Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

Add code
May 05, 2023
Figure 1 for Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Figure 2 for Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Figure 3 for Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Figure 4 for Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Viaarxiv icon

Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Add code
Apr 23, 2023
Figure 1 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Figure 2 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Figure 3 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Figure 4 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Viaarxiv icon

Efficient Deep Reinforcement Learning Requires Regulating Overfitting

Add code
Apr 20, 2023
Figure 1 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Figure 2 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Figure 3 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Figure 4 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Viaarxiv icon

IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies

Add code
Apr 20, 2023
Figure 1 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 2 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 3 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 4 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Viaarxiv icon

Learning and Adapting Agile Locomotion Skills by Transferring Experience

Add code
Apr 19, 2023
Figure 1 for Learning and Adapting Agile Locomotion Skills by Transferring Experience
Figure 2 for Learning and Adapting Agile Locomotion Skills by Transferring Experience
Figure 3 for Learning and Adapting Agile Locomotion Skills by Transferring Experience
Figure 4 for Learning and Adapting Agile Locomotion Skills by Transferring Experience
Viaarxiv icon

FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing

Add code
Apr 19, 2023
Figure 1 for FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing
Figure 2 for FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing
Figure 3 for FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing
Figure 4 for FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing
Viaarxiv icon