Picture for Zedong Dan

Zedong Dan

Targeted Exploration via Unified Entropy Control for Reinforcement Learning

Add code
Apr 16, 2026
Viaarxiv icon