Alert button
Picture for Aviral Kumar

Aviral Kumar

Alert button

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 27, 2020
Anurag Ajay, Aviral Kumar, Pulkit Agrawal, Sergey Levine, Ofir Nachum

Figure 1 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Figure 2 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Figure 3 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Figure 4 for OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Viaarxiv icon

Conservative Q-Learning for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 29, 2020
Aviral Kumar, Aurick Zhou, George Tucker, Sergey Levine

Figure 1 for Conservative Q-Learning for Offline Reinforcement Learning
Figure 2 for Conservative Q-Learning for Offline Reinforcement Learning
Figure 3 for Conservative Q-Learning for Offline Reinforcement Learning
Figure 4 for Conservative Q-Learning for Offline Reinforcement Learning
Viaarxiv icon

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Add code
Bookmark button
Alert button
May 04, 2020
Sergey Levine, Aviral Kumar, George Tucker, Justin Fu

Figure 1 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Figure 2 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Figure 3 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Figure 4 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Viaarxiv icon

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 20, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

Figure 1 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 2 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Figure 3 for D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Viaarxiv icon

Datasets for Data-Driven Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 15, 2020
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine

Figure 1 for Datasets for Data-Driven Reinforcement Learning
Figure 2 for Datasets for Data-Driven Reinforcement Learning
Figure 3 for Datasets for Data-Driven Reinforcement Learning
Viaarxiv icon

DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction

Add code
Bookmark button
Alert button
Mar 16, 2020
Aviral Kumar, Abhishek Gupta, Sergey Levine

Figure 1 for DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Figure 2 for DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Figure 3 for DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Figure 4 for DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Viaarxiv icon

Reward-Conditioned Policies

Add code
Bookmark button
Alert button
Dec 31, 2019
Aviral Kumar, Xue Bin Peng, Sergey Levine

Figure 1 for Reward-Conditioned Policies
Figure 2 for Reward-Conditioned Policies
Figure 3 for Reward-Conditioned Policies
Figure 4 for Reward-Conditioned Policies
Viaarxiv icon

Model Inversion Networks for Model-Based Optimization

Add code
Bookmark button
Alert button
Dec 31, 2019
Aviral Kumar, Sergey Levine

Figure 1 for Model Inversion Networks for Model-Based Optimization
Figure 2 for Model Inversion Networks for Model-Based Optimization
Figure 3 for Model Inversion Networks for Model-Based Optimization
Figure 4 for Model Inversion Networks for Model-Based Optimization
Viaarxiv icon

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 07, 2019
Xue Bin Peng, Aviral Kumar, Grace Zhang, Sergey Levine

Figure 1 for Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Figure 2 for Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Figure 3 for Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Figure 4 for Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Viaarxiv icon