Picture for Zhaowei Hong

Zhaowei Hong

Revisiting Zeroth-Order Hessian Approximation: A Single-Step Policy Optimization Lens

Add code
May 29, 2026
Viaarxiv icon