Picture for Andy Peng

Andy Peng

Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Add code
Jun 09, 2026
Viaarxiv icon

Fisher-Orthogonal Projected Natural Gradient Descent for Continual Learning

Add code
Jan 19, 2026
Viaarxiv icon

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Add code
Dec 10, 2024
Figure 1 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 2 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 3 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Figure 4 for Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Viaarxiv icon