Picture for Yujing Bian

Yujing Bian

Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

Add code
Apr 06, 2026
Viaarxiv icon