Picture for Zhaohui Tong

Zhaohui Tong

Actor-Accelerated Policy Dual Averaging for Reinforcement Learning in Continuous Action Spaces

Add code
Mar 10, 2026
Viaarxiv icon