Picture for Phalguni Nanda

Phalguni Nanda

Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework

Add code
May 11, 2026
Viaarxiv icon