Picture for Martin Schmid

Martin Schmid

Artificial Generals Intelligence: Mastering Generals.io with Reinforcement Learning

Add code
Jul 09, 2025
Viaarxiv icon

Meta-Learning in Self-Play Regret Minimization

Add code
Apr 26, 2025
Viaarxiv icon

Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents

Add code
Apr 25, 2024
Figure 1 for Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Figure 2 for Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Figure 3 for Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Figure 4 for Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Viaarxiv icon

Learning not to Regret

Add code
Mar 02, 2023
Figure 1 for Learning not to Regret
Figure 2 for Learning not to Regret
Figure 3 for Learning not to Regret
Figure 4 for Learning not to Regret
Viaarxiv icon

Player of Games

Add code
Dec 06, 2021
Figure 1 for Player of Games
Figure 2 for Player of Games
Figure 3 for Player of Games
Figure 4 for Player of Games
Viaarxiv icon

Search in Imperfect Information Games

Add code
Nov 10, 2021
Figure 1 for Search in Imperfect Information Games
Figure 2 for Search in Imperfect Information Games
Figure 3 for Search in Imperfect Information Games
Figure 4 for Search in Imperfect Information Games
Viaarxiv icon

Solving Common-Payoff Games with Approximate Policy Iteration

Add code
Jan 11, 2021
Figure 1 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 2 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 3 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 4 for Solving Common-Payoff Games with Approximate Policy Iteration
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Aug 27, 2020
Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

Approximate exploitability: Learning a best response in large games

Add code
Apr 20, 2020
Figure 1 for Approximate exploitability: Learning a best response in large games
Figure 2 for Approximate exploitability: Learning a best response in large games
Figure 3 for Approximate exploitability: Learning a best response in large games
Figure 4 for Approximate exploitability: Learning a best response in large games
Viaarxiv icon

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Add code
Jul 22, 2019
Figure 1 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 2 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 3 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Viaarxiv icon