Picture for Max Qiushi Lin

Max Qiushi Lin

Rethinking the Global Convergence of Softmax Policy Gradient with Linear Function Approximation

Add code
May 06, 2025
Viaarxiv icon