Picture for Lorenzo Mancini

Lorenzo Mancini

Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games

Add code
May 28, 2025
Viaarxiv icon

Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

Add code
Oct 30, 2024
Viaarxiv icon