Alert button
Picture for Michael Bowling

Michael Bowling

Alert button

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Add code
Bookmark button
Alert button
Oct 21, 2018
Sriram Srinivasan, Marc Lanctot, Vinicius Zambaldi, Julien Perolat, Karl Tuyls, Remi Munos, Michael Bowling

Figure 1 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 2 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 3 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 4 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Viaarxiv icon

Generalization and Regularization in DQN

Add code
Bookmark button
Alert button
Sep 29, 2018
Jesse Farebrother, Marlos C. Machado, Michael Bowling

Figure 1 for Generalization and Regularization in DQN
Figure 2 for Generalization and Regularization in DQN
Figure 3 for Generalization and Regularization in DQN
Figure 4 for Generalization and Regularization in DQN
Viaarxiv icon

Solving Large Extensive-Form Games with Strategy Constraints

Add code
Bookmark button
Alert button
Sep 20, 2018
Trevor Davis, Kevin Waugh, Michael Bowling

Figure 1 for Solving Large Extensive-Form Games with Strategy Constraints
Figure 2 for Solving Large Extensive-Form Games with Strategy Constraints
Figure 3 for Solving Large Extensive-Form Games with Strategy Constraints
Viaarxiv icon

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines

Add code
Bookmark button
Alert button
Sep 09, 2018
Martin Schmid, Neil Burch, Marc Lanctot, Matej Moravcik, Rudolf Kadlec, Michael Bowling

Figure 1 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 2 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 3 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 4 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Viaarxiv icon

Count-Based Exploration with the Successor Representation

Add code
Bookmark button
Alert button
Aug 14, 2018
Marlos C. Machado, Marc G. Bellemare, Michael Bowling

Figure 1 for Count-Based Exploration with the Successor Representation
Figure 2 for Count-Based Exploration with the Successor Representation
Figure 3 for Count-Based Exploration with the Successor Representation
Figure 4 for Count-Based Exploration with the Successor Representation
Viaarxiv icon

The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces

Add code
Bookmark button
Alert button
Jun 08, 2018
G. Zacharias Holland, Erik Talvitie, Michael Bowling

Figure 1 for The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
Figure 2 for The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
Viaarxiv icon

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

Add code
Bookmark button
Alert button
Dec 01, 2017
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew Hausknecht, Michael Bowling

Figure 1 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Figure 2 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Figure 3 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Figure 4 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Viaarxiv icon

A Laplacian Framework for Option Discovery in Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 16, 2017
Marlos C. Machado, Marc G. Bellemare, Michael Bowling

Figure 1 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Figure 2 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Figure 3 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Figure 4 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Viaarxiv icon

DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

Add code
Bookmark button
Alert button
Mar 03, 2017
Matej Moravčík, Martin Schmid, Neil Burch, Viliam Lisý, Dustin Morrill, Nolan Bard, Trevor Davis, Kevin Waugh, Michael Johanson, Michael Bowling

Viaarxiv icon

AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games

Add code
Bookmark button
Alert button
Jan 19, 2017
Neil Burch, Martin Schmid, Matej Moravčík, Michael Bowling

Figure 1 for AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games
Figure 2 for AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games
Figure 3 for AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games
Figure 4 for AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games
Viaarxiv icon