Picture for Max Qiushi Lin

Max Qiushi Lin

Optimistic Actor-Critic with Parametric Policies for Linear Markov Decision Processes

Add code
Apr 01, 2026
Viaarxiv icon

Rethinking the Global Convergence of Softmax Policy Gradient with Linear Function Approximation

Add code
May 06, 2025
Viaarxiv icon