Picture for Deyi Wang

Deyi Wang

Efficient Federated RLHF via Zeroth-Order Policy Optimization

Add code
Apr 20, 2026
Viaarxiv icon