Picture for Tuomas Sandholm

Tuomas Sandholm

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Add code
Jul 22, 2023
Viaarxiv icon

On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

Add code
Jan 26, 2023
Figure 1 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 2 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 3 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Figure 4 for On the Convergence of No-Regret Learning Dynamics in Time-Varying Games
Viaarxiv icon

Computing equilibria by minimizing exploitability with best-response ensembles

Add code
Jan 20, 2023
Viaarxiv icon

Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networks

Add code
Nov 29, 2022
Viaarxiv icon

Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

Add code
Aug 20, 2022
Figure 1 for Near-Optimal $Φ$-Regret Learning in Extensive-Form Games
Figure 2 for Near-Optimal $Φ$-Regret Learning in Extensive-Form Games
Viaarxiv icon

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

Add code
Jul 13, 2022
Figure 1 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 2 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 3 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 4 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Viaarxiv icon

Near-Optimal No-Regret Learning for General Convex Games

Add code
Jun 20, 2022
Figure 1 for Near-Optimal No-Regret Learning for General Convex Games
Viaarxiv icon

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret

Add code
Jun 08, 2022
Figure 1 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Figure 2 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Figure 3 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Figure 4 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Viaarxiv icon

Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games

Add code
Apr 25, 2022
Figure 1 for Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games
Figure 2 for Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games
Figure 3 for Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games
Viaarxiv icon

Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts

Add code
Apr 15, 2022
Figure 1 for Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts
Figure 2 for Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts
Figure 3 for Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts
Figure 4 for Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts
Viaarxiv icon