Picture for Andy Peng

Andy Peng

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Add code
Dec 10, 2024
Figure 1 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 2 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 3 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 4 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Viaarxiv icon