Picture for Marc Lanctot

Marc Lanctot

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections

Add code
May 24, 2022
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Viaarxiv icon

Anytime PSRO for Two-Player Zero-Sum Games

Add code
Jan 28, 2022
Figure 1 for Anytime PSRO for Two-Player Zero-Sum Games
Figure 2 for Anytime PSRO for Two-Player Zero-Sum Games
Figure 3 for Anytime PSRO for Two-Player Zero-Sum Games
Figure 4 for Anytime PSRO for Two-Player Zero-Sum Games
Viaarxiv icon

Player of Games

Add code
Dec 06, 2021
Figure 1 for Player of Games
Figure 2 for Player of Games
Figure 3 for Player of Games
Figure 4 for Player of Games
Viaarxiv icon

Dynamic population-based meta-learning for multi-agent communication with natural language

Add code
Oct 27, 2021
Figure 1 for Dynamic population-based meta-learning for multi-agent communication with natural language
Figure 2 for Dynamic population-based meta-learning for multi-agent communication with natural language
Figure 3 for Dynamic population-based meta-learning for multi-agent communication with natural language
Figure 4 for Dynamic population-based meta-learning for multi-agent communication with natural language
Viaarxiv icon

Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers

Add code
Jun 22, 2021
Figure 1 for Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Figure 2 for Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Figure 3 for Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Figure 4 for Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Add code
Feb 13, 2021
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Viaarxiv icon

Solving Common-Payoff Games with Approximate Policy Iteration

Add code
Jan 11, 2021
Figure 1 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 2 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 3 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 4 for Solving Common-Payoff Games with Approximate Policy Iteration
Viaarxiv icon

Hindsight and Sequential Rationality of Correlated Play

Add code
Dec 17, 2020
Figure 1 for Hindsight and Sequential Rationality of Correlated Play
Figure 2 for Hindsight and Sequential Rationality of Correlated Play
Figure 3 for Hindsight and Sequential Rationality of Correlated Play
Figure 4 for Hindsight and Sequential Rationality of Correlated Play
Viaarxiv icon

Negotiating Team Formation Using Deep Reinforcement Learning

Add code
Oct 20, 2020
Figure 1 for Negotiating Team Formation Using Deep Reinforcement Learning
Figure 2 for Negotiating Team Formation Using Deep Reinforcement Learning
Figure 3 for Negotiating Team Formation Using Deep Reinforcement Learning
Figure 4 for Negotiating Team Formation Using Deep Reinforcement Learning
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Aug 27, 2020
Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon