Alert button
Picture for Michael Bowling

Michael Bowling

Alert button

Should Models Be Accurate?

May 22, 2022
Esra'a Saleh, John D. Martin, Anna Koop, Arash Pourzarabi, Michael Bowling

Figure 1 for Should Models Be Accurate?
Viaarxiv icon

Player of Games

Dec 06, 2021
Martin Schmid, Matej Moravcik, Neil Burch, Rudolf Kadlec, Josh Davidson, Kevin Waugh, Nolan Bard, Finbarr Timbers, Marc Lanctot, Zach Holland, Elnaz Davoodi, Alden Christianson, Michael Bowling

Figure 1 for Player of Games
Figure 2 for Player of Games
Figure 3 for Player of Games
Figure 4 for Player of Games
Viaarxiv icon

The Partially Observable History Process

Nov 15, 2021
Dustin Morrill, Amy R. Greenwald, Michael Bowling

Figure 1 for The Partially Observable History Process
Viaarxiv icon

Learning to Be Cautious

Oct 29, 2021
Montaser Mohammedalamen, Dustin Morrill, Alexander Sieusahai, Yash Satsangi, Michael Bowling

Figure 1 for Learning to Be Cautious
Figure 2 for Learning to Be Cautious
Figure 3 for Learning to Be Cautious
Figure 4 for Learning to Be Cautious
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Feb 13, 2021
Dustin Morrill, Ryan D'Orazio, Marc Lanctot, James R. Wright, Michael Bowling, Amy Greenwald

Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Viaarxiv icon

Solving Common-Payoff Games with Approximate Policy Iteration

Jan 11, 2021
Samuel Sokota, Edward Lockhart, Finbarr Timbers, Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot

Figure 1 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 2 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 3 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 4 for Solving Common-Payoff Games with Approximate Policy Iteration
Viaarxiv icon

Hindsight and Sequential Rationality of Correlated Play

Dec 17, 2020
Dustin Morrill, Ryan D'Orazio, Reca Sarfati, Marc Lanctot, James R. Wright, Amy Greenwald, Michael Bowling

Figure 1 for Hindsight and Sequential Rationality of Correlated Play
Figure 2 for Hindsight and Sequential Rationality of Correlated Play
Figure 3 for Hindsight and Sequential Rationality of Correlated Play
Figure 4 for Hindsight and Sequential Rationality of Correlated Play
Viaarxiv icon

Useful Policy Invariant Shaping from Arbitrary Advice

Nov 02, 2020
Paniz Behboudian, Yash Satsangi, Matthew E. Taylor, Anna Harutyunyan, Michael Bowling

Figure 1 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 2 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 3 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 4 for Useful Policy Invariant Shaping from Arbitrary Advice
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Aug 27, 2020
Audrūnas Gruslys, Marc Lanctot, Rémi Munos, Finbarr Timbers, Martin Schmid, Julien Perolat, Dustin Morrill, Vinicius Zambaldi, Jean-Baptiste Lespiau, John Schultz, Mohammad Gheshlaghi Azar, Michael Bowling, Karl Tuyls

Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon