Picture for Hanyong Wang

Hanyong Wang

ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm

Add code
Feb 10, 2026
Viaarxiv icon