Picture for Xiaoshuang Wang

Xiaoshuang Wang

Policy Gradient Optimzation for Bayesian-Risk MDPs with General Convex Losses

Add code
Sep 19, 2025
Viaarxiv icon