Picture for Sergey Levine

Sergey Levine

Stanford University

Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes

Add code
Nov 28, 2022
Viaarxiv icon

Data-Driven Offline Decision-Making via Invariant Representation Learning

Add code
Nov 25, 2022
Viaarxiv icon

Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models

Add code
Nov 22, 2022
Viaarxiv icon

Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints

Add code
Nov 21, 2022
Viaarxiv icon

Dual Generator Offline Reinforcement Learning

Add code
Nov 02, 2022
Viaarxiv icon

Adversarial Policies Beat Professional-Level Go AIs

Add code
Nov 01, 2022
Figure 1 for Adversarial Policies Beat Professional-Level Go AIs
Figure 2 for Adversarial Policies Beat Professional-Level Go AIs
Figure 3 for Adversarial Policies Beat Professional-Level Go AIs
Figure 4 for Adversarial Policies Beat Professional-Level Go AIs
Viaarxiv icon

Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision

Add code
Oct 27, 2022
Viaarxiv icon

FCM: Forgetful Causal Masking Makes Causal Language Models Better Zero-Shot Learners

Add code
Oct 24, 2022
Viaarxiv icon

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

Add code
Oct 18, 2022
Figure 1 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Figure 2 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Figure 3 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Figure 4 for Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Viaarxiv icon

You Only Live Once: Single-Life Reinforcement Learning

Add code
Oct 17, 2022
Figure 1 for You Only Live Once: Single-Life Reinforcement Learning
Figure 2 for You Only Live Once: Single-Life Reinforcement Learning
Figure 3 for You Only Live Once: Single-Life Reinforcement Learning
Figure 4 for You Only Live Once: Single-Life Reinforcement Learning
Viaarxiv icon