Picture for Gabriele Farina

Gabriele Farina

Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms

Add code
Jun 15, 2024
Viaarxiv icon

Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property

Add code
Dec 21, 2023
Viaarxiv icon

Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning

Add code
Nov 16, 2023
Viaarxiv icon

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

Add code
Nov 01, 2023
Figure 1 for Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
Figure 2 for Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
Figure 3 for Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
Viaarxiv icon

The Consensus Game: Language Model Generation via Equilibrium Search

Add code
Oct 13, 2023
Figure 1 for The Consensus Game: Language Model Generation via Equilibrium Search
Figure 2 for The Consensus Game: Language Model Generation via Equilibrium Search
Figure 3 for The Consensus Game: Language Model Generation via Equilibrium Search
Figure 4 for The Consensus Game: Language Model Generation via Equilibrium Search
Viaarxiv icon

Regret Matching+: (In)Stability and Fast Convergence in Games

Add code
May 24, 2023
Figure 1 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 2 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 3 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 4 for Regret Matching+: (In)Stability and Fast Convergence in Games
Viaarxiv icon

The Update Equivalence Framework for Decision-Time Planning

Add code
Apr 25, 2023
Figure 1 for The Update Equivalence Framework for Decision-Time Planning
Figure 2 for The Update Equivalence Framework for Decision-Time Planning
Figure 3 for The Update Equivalence Framework for Decision-Time Planning
Figure 4 for The Update Equivalence Framework for Decision-Time Planning
Viaarxiv icon

On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

Add code
Jan 26, 2023
Figure 1 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 2 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 3 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 4 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Viaarxiv icon

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Add code
Oct 11, 2022
Figure 1 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 2 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 3 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 4 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Viaarxiv icon

Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

Add code
Aug 20, 2022
Figure 1 for Near-Optimal $Φ$-Regret Learning in Extensive-Form Games
Figure 2 for Near-Optimal $Φ$-Regret Learning in Extensive-Form Games
Viaarxiv icon