Picture for Pengcuo Zeren

Pengcuo Zeren

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Add code
Apr 08, 2026
Viaarxiv icon