Picture for Lorenzo Mancini

Lorenzo Mancini

Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games

Add code
May 28, 2025
Viaarxiv icon

Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

Add code
Oct 30, 2024
Figure 1 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Figure 2 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Figure 3 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Figure 4 for Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Viaarxiv icon