Alert button
Picture for Max Sobol Mark

Max Sobol Mark

Alert button

Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 23, 2023
Jingyun Yang, Max Sobol Mark, Brandon Vu, Archit Sharma, Jeannette Bohg, Chelsea Finn

Viaarxiv icon

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias

Add code
Bookmark button
Alert button
Oct 12, 2023
Max Sobol Mark, Archit Sharma, Fahim Tajwar, Rafael Rafailov, Sergey Levine, Chelsea Finn

Figure 1 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Figure 2 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Figure 3 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Figure 4 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Viaarxiv icon

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Add code
Bookmark button
Alert button
Mar 09, 2023
Mitsuhiko Nakamoto, Yuexiang Zhai, Anikait Singh, Max Sobol Mark, Yi Ma, Chelsea Finn, Aviral Kumar, Sergey Levine

Figure 1 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Figure 2 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Figure 3 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Figure 4 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Viaarxiv icon