Picture for Michael Bowling

Michael Bowling

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

Add code
Nov 04, 2018
Figure 1 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 2 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 3 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 4 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Add code
Oct 21, 2018
Figure 1 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 2 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 3 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 4 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Viaarxiv icon

Generalization and Regularization in DQN

Add code
Sep 29, 2018
Figure 1 for Generalization and Regularization in DQN
Figure 2 for Generalization and Regularization in DQN
Figure 3 for Generalization and Regularization in DQN
Figure 4 for Generalization and Regularization in DQN
Viaarxiv icon

Solving Large Extensive-Form Games with Strategy Constraints

Add code
Sep 20, 2018
Figure 1 for Solving Large Extensive-Form Games with Strategy Constraints
Figure 2 for Solving Large Extensive-Form Games with Strategy Constraints
Figure 3 for Solving Large Extensive-Form Games with Strategy Constraints
Viaarxiv icon

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines

Add code
Sep 09, 2018
Figure 1 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 2 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 3 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 4 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Viaarxiv icon

Count-Based Exploration with the Successor Representation

Add code
Aug 14, 2018
Figure 1 for Count-Based Exploration with the Successor Representation
Figure 2 for Count-Based Exploration with the Successor Representation
Figure 3 for Count-Based Exploration with the Successor Representation
Figure 4 for Count-Based Exploration with the Successor Representation
Viaarxiv icon

The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces

Add code
Jun 08, 2018
Figure 1 for The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
Figure 2 for The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
Viaarxiv icon

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

Add code
Dec 01, 2017
Figure 1 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Figure 2 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Figure 3 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Figure 4 for Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Viaarxiv icon

A Laplacian Framework for Option Discovery in Reinforcement Learning

Add code
Jun 16, 2017
Figure 1 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Figure 2 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Figure 3 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Figure 4 for A Laplacian Framework for Option Discovery in Reinforcement Learning
Viaarxiv icon

DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

Add code
Mar 03, 2017
Viaarxiv icon