Alert button
Picture for Michael Bowling

Michael Bowling

Alert button

Marginal Utility for Planning in Continuous or Large Discrete Action Spaces

Add code
Bookmark button
Alert button
Jun 17, 2020
Zaheen Farraz Ahmad, Levi H. S. Lelis, Michael Bowling

Figure 1 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Figure 2 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Figure 3 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Figure 4 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Viaarxiv icon

Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task

Add code
Bookmark button
Alert button
Apr 28, 2020
Katya Kudashkina, Valliappa Chockalingam, Graham W. Taylor, Michael Bowling

Figure 1 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Figure 2 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Figure 3 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Figure 4 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Viaarxiv icon

Approximate exploitability: Learning a best response in large games

Add code
Bookmark button
Alert button
Apr 20, 2020
Finbarr Timbers, Edward Lockhart, Martin Schmid, Marc Lanctot, Michael Bowling

Figure 1 for Approximate exploitability: Learning a best response in large games
Figure 2 for Approximate exploitability: Learning a best response in large games
Figure 3 for Approximate exploitability: Learning a best response in large games
Figure 4 for Approximate exploitability: Learning a best response in large games
Viaarxiv icon

Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization

Add code
Bookmark button
Alert button
Dec 06, 2019
Ryan D'Orazio, Dustin Morrill, James R. Wright, Michael Bowling

Figure 1 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Figure 2 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Figure 3 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Viaarxiv icon

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Add code
Bookmark button
Alert button
Jul 22, 2019
Trevor Davis, Martin Schmid, Michael Bowling

Figure 1 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 2 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 3 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Viaarxiv icon

Rethinking Formal Models of Partially Observable Multiagent Decision Making

Add code
Bookmark button
Alert button
Jun 26, 2019
Vojtěch Kovařík, Martin Schmid, Neil Burch, Michael Bowling, Viliam Lisý

Figure 1 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 2 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 3 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 4 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Viaarxiv icon

Ease-of-Teaching and Language Structure from Emergent Communication

Add code
Bookmark button
Alert button
Jun 06, 2019
Fushan Li, Michael Bowling

Figure 1 for Ease-of-Teaching and Language Structure from Emergent Communication
Figure 2 for Ease-of-Teaching and Language Structure from Emergent Communication
Figure 3 for Ease-of-Teaching and Language Structure from Emergent Communication
Figure 4 for Ease-of-Teaching and Language Structure from Emergent Communication
Viaarxiv icon

The Hanabi Challenge: A New Frontier for AI Research

Add code
Bookmark button
Alert button
Feb 01, 2019
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling

Figure 1 for The Hanabi Challenge: A New Frontier for AI Research
Figure 2 for The Hanabi Challenge: A New Frontier for AI Research
Figure 3 for The Hanabi Challenge: A New Frontier for AI Research
Figure 4 for The Hanabi Challenge: A New Frontier for AI Research
Viaarxiv icon

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2018
Jakob N. Foerster, Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew Botvinick, Michael Bowling

Figure 1 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 2 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 3 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 4 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon