Picture for Zhichao Jia

Zhichao Jia

Value Mirror Descent for Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon