Picture for Mahdi Farahbakhsh

Mahdi Farahbakhsh

Reinforcement Learning for Diffusion LLMs with Entropy-Guided Step Selection and Stepwise Advantages

Add code
Mar 13, 2026
Viaarxiv icon