Picture for Michael Bowling

Michael Bowling

Over-communicate no more: Situated RL agents learn concise communication protocols

Add code
Nov 02, 2022
Figure 1 for Over-communicate no more: Situated RL agents learn concise communication protocols
Figure 2 for Over-communicate no more: Situated RL agents learn concise communication protocols
Figure 3 for Over-communicate no more: Situated RL agents learn concise communication protocols
Figure 4 for Over-communicate no more: Situated RL agents learn concise communication protocols
Viaarxiv icon

Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration

Add code
Jun 04, 2022
Figure 1 for Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections

Add code
May 24, 2022
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Viaarxiv icon

Should Models Be Accurate?

Add code
May 22, 2022
Figure 1 for Should Models Be Accurate?
Viaarxiv icon

Player of Games

Add code
Dec 06, 2021
Figure 1 for Player of Games
Figure 2 for Player of Games
Figure 3 for Player of Games
Figure 4 for Player of Games
Viaarxiv icon

The Partially Observable History Process

Add code
Nov 15, 2021
Figure 1 for The Partially Observable History Process
Viaarxiv icon

Learning to Be Cautious

Add code
Oct 29, 2021
Figure 1 for Learning to Be Cautious
Figure 2 for Learning to Be Cautious
Figure 3 for Learning to Be Cautious
Figure 4 for Learning to Be Cautious
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Add code
Feb 13, 2021
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Viaarxiv icon

Solving Common-Payoff Games with Approximate Policy Iteration

Add code
Jan 11, 2021
Figure 1 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 2 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 3 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 4 for Solving Common-Payoff Games with Approximate Policy Iteration
Viaarxiv icon

Hindsight and Sequential Rationality of Correlated Play

Add code
Dec 17, 2020
Figure 1 for Hindsight and Sequential Rationality of Correlated Play
Figure 2 for Hindsight and Sequential Rationality of Correlated Play
Figure 3 for Hindsight and Sequential Rationality of Correlated Play
Figure 4 for Hindsight and Sequential Rationality of Correlated Play
Viaarxiv icon