Picture for Wenjun Cao

Wenjun Cao

Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization

Add code
May 07, 2025
Viaarxiv icon

Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance

Add code
Apr 26, 2025
Viaarxiv icon