Alert button
Picture for Gabriele Farina

Gabriele Farina

Alert button

Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property

Dec 21, 2023
Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

Viaarxiv icon

Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning

Nov 16, 2023
Athul Paul Jacob, Gabriele Farina, Jacob Andreas

Viaarxiv icon

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

Nov 01, 2023
Yang Cai, Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Weiqiang Zheng

Viaarxiv icon

The Consensus Game: Language Model Generation via Equilibrium Search

Oct 13, 2023
Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas

Figure 1 for The Consensus Game: Language Model Generation via Equilibrium Search
Figure 2 for The Consensus Game: Language Model Generation via Equilibrium Search
Figure 3 for The Consensus Game: Language Model Generation via Equilibrium Search
Figure 4 for The Consensus Game: Language Model Generation via Equilibrium Search
Viaarxiv icon

Regret Matching+: (In)Stability and Fast Convergence in Games

May 24, 2023
Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo

Figure 1 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 2 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 3 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 4 for Regret Matching+: (In)Stability and Fast Convergence in Games
Viaarxiv icon

The Update Equivalence Framework for Decision-Time Planning

Apr 25, 2023
Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown

Figure 1 for The Update Equivalence Framework for Decision-Time Planning
Figure 2 for The Update Equivalence Framework for Decision-Time Planning
Figure 3 for The Update Equivalence Framework for Decision-Time Planning
Figure 4 for The Update Equivalence Framework for Decision-Time Planning
Viaarxiv icon

On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

Jan 26, 2023
Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

Figure 1 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 2 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 3 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 4 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Viaarxiv icon

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Oct 11, 2022
Anton Bakhtin, David J Wu, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H Miller, Noam Brown

Figure 1 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 2 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 3 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 4 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Viaarxiv icon

Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

Aug 20, 2022
Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

Figure 1 for Near-Optimal $Φ$-Regret Learning in Extensive-Form Games
Figure 2 for Near-Optimal $Φ$-Regret Learning in Extensive-Form Games
Viaarxiv icon

Near-Optimal No-Regret Learning for General Convex Games

Jun 20, 2022
Gabriele Farina, Ioannis Anagnostides, Haipeng Luo, Chung-Wei Lee, Christian Kroer, Tuomas Sandholm

Figure 1 for Near-Optimal No-Regret Learning for General Convex Games
Viaarxiv icon