Picture for Safwan Labbi

Safwan Labbi

Beyond Softmax and Entropy: Improving Convergence Guarantees of Policy Gradients by f-SoftArgmax Parameterization with Coupled Regularization

Add code
Jan 18, 2026
Viaarxiv icon

On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment

Add code
May 29, 2025
Figure 1 for On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment
Figure 2 for On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment
Figure 3 for On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment
Figure 4 for On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment
Viaarxiv icon

Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

Add code
Oct 30, 2024
Figure 1 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Figure 2 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Figure 3 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Figure 4 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Viaarxiv icon

SCAFFLSA: Quantifying and Eliminating Heterogeneity Bias in Federated Linear Stochastic Approximation and Temporal Difference Learning

Add code
Feb 06, 2024
Viaarxiv icon