Picture for Michael Bowling

Michael Bowling

Useful Policy Invariant Shaping from Arbitrary Advice

Add code
Nov 02, 2020
Figure 1 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 2 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 3 for Useful Policy Invariant Shaping from Arbitrary Advice
Figure 4 for Useful Policy Invariant Shaping from Arbitrary Advice
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Aug 27, 2020
Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

Marginal Utility for Planning in Continuous or Large Discrete Action Spaces

Add code
Jun 17, 2020
Figure 1 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Figure 2 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Figure 3 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Figure 4 for Marginal Utility for Planning in Continuous or Large Discrete Action Spaces
Viaarxiv icon

Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task

Add code
Apr 28, 2020
Figure 1 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Figure 2 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Figure 3 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Figure 4 for Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Viaarxiv icon

Approximate exploitability: Learning a best response in large games

Add code
Apr 20, 2020
Figure 1 for Approximate exploitability: Learning a best response in large games
Figure 2 for Approximate exploitability: Learning a best response in large games
Figure 3 for Approximate exploitability: Learning a best response in large games
Figure 4 for Approximate exploitability: Learning a best response in large games
Viaarxiv icon

Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization

Add code
Dec 06, 2019
Figure 1 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Figure 2 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Figure 3 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Viaarxiv icon

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Add code
Jul 22, 2019
Figure 1 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 2 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 3 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Viaarxiv icon

Rethinking Formal Models of Partially Observable Multiagent Decision Making

Add code
Jun 26, 2019
Figure 1 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 2 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 3 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 4 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Viaarxiv icon

Ease-of-Teaching and Language Structure from Emergent Communication

Add code
Jun 06, 2019
Figure 1 for Ease-of-Teaching and Language Structure from Emergent Communication
Figure 2 for Ease-of-Teaching and Language Structure from Emergent Communication
Figure 3 for Ease-of-Teaching and Language Structure from Emergent Communication
Figure 4 for Ease-of-Teaching and Language Structure from Emergent Communication
Viaarxiv icon

The Hanabi Challenge: A New Frontier for AI Research

Add code
Feb 01, 2019
Figure 1 for The Hanabi Challenge: A New Frontier for AI Research
Figure 2 for The Hanabi Challenge: A New Frontier for AI Research
Figure 3 for The Hanabi Challenge: A New Frontier for AI Research
Figure 4 for The Hanabi Challenge: A New Frontier for AI Research
Viaarxiv icon