Picture for Nathan Tsang

Nathan Tsang

EMAgnet: Parameter-Space EMA Regularization for Policy Gradient Self-Play in Large Games

Add code
Jun 22, 2026
Viaarxiv icon