Picture for Marc Lanctot

Marc Lanctot

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Add code
Jun 17, 2020
Figure 1 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 2 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 3 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 4 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Viaarxiv icon

Approximate exploitability: Learning a best response in large games

Add code
Apr 20, 2020
Figure 1 for Approximate exploitability: Learning a best response in large games
Figure 2 for Approximate exploitability: Learning a best response in large games
Figure 3 for Approximate exploitability: Learning a best response in large games
Figure 4 for Approximate exploitability: Learning a best response in large games
Viaarxiv icon

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

Add code
Feb 19, 2020
Figure 1 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Figure 2 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Figure 3 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Figure 4 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Viaarxiv icon

OpenSpiel: A Framework for Reinforcement Learning in Games

Add code
Oct 10, 2019
Figure 1 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 2 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 3 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 4 for OpenSpiel: A Framework for Reinforcement Learning in Games
Viaarxiv icon

A Generalized Training Approach for Multiagent Learning

Add code
Sep 27, 2019
Figure 1 for A Generalized Training Approach for Multiagent Learning
Figure 2 for A Generalized Training Approach for Multiagent Learning
Figure 3 for A Generalized Training Approach for Multiagent Learning
Figure 4 for A Generalized Training Approach for Multiagent Learning
Viaarxiv icon

Neural Replicator Dynamics

Add code
Jun 01, 2019
Figure 1 for Neural Replicator Dynamics
Figure 2 for Neural Replicator Dynamics
Figure 3 for Neural Replicator Dynamics
Figure 4 for Neural Replicator Dynamics
Viaarxiv icon

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent

Add code
Mar 21, 2019
Figure 1 for Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
Viaarxiv icon

Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research

Add code
Mar 11, 2019
Figure 1 for Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Figure 2 for Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Viaarxiv icon

The Hanabi Challenge: A New Frontier for AI Research

Add code
Feb 01, 2019
Figure 1 for The Hanabi Challenge: A New Frontier for AI Research
Figure 2 for The Hanabi Challenge: A New Frontier for AI Research
Figure 3 for The Hanabi Challenge: A New Frontier for AI Research
Figure 4 for The Hanabi Challenge: A New Frontier for AI Research
Viaarxiv icon

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Add code
Oct 21, 2018
Figure 1 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 2 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 3 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 4 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Viaarxiv icon