Picture for Safwan Labbi

Safwan Labbi

Beyond Softmax and Entropy: Improving Convergence Guarantees of Policy Gradients by f-SoftArgmax Parameterization with Coupled Regularization

Add code
Jan 18, 2026
Viaarxiv icon

On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment

Add code
May 29, 2025
Viaarxiv icon

Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

Add code
Oct 30, 2024
Viaarxiv icon

SCAFFLSA: Quantifying and Eliminating Heterogeneity Bias in Federated Linear Stochastic Approximation and Temporal Difference Learning

Add code
Feb 06, 2024
Viaarxiv icon