Picture for Max Sobol Mark

Max Sobol Mark

Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone

Add code
Dec 09, 2024
Viaarxiv icon

Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning

Add code
Oct 23, 2023
Figure 1 for Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Figure 2 for Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Figure 3 for Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Figure 4 for Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Viaarxiv icon

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias

Add code
Oct 12, 2023
Figure 1 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Figure 2 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Figure 3 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Figure 4 for Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Viaarxiv icon

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Add code
Mar 09, 2023
Viaarxiv icon