Picture for Chensheng Dai

Chensheng Dai

E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models

Add code
Jan 01, 2026
Viaarxiv icon