Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Solving Games with Functional Regret Estimation

Dec 31, 2014

Kevin Waugh, Dustin Morrill, J. Andrew Bagnell, Michael Bowling

Figure 1 for Solving Games with Functional Regret Estimation

Figure 2 for Solving Games with Functional Regret Estimation

Figure 3 for Solving Games with Functional Regret Estimation

Share this with someone who'll enjoy it:

Abstract:We propose a novel online learning method for minimizing regret in large extensive-form games. The approach learns a function approximator online to estimate the regret for choosing a particular action. A no-regret algorithm uses these estimates in place of the true regrets to define a sequence of policies. We prove the approach sound by providing a bound relating the quality of the function approximation and regret of the algorithm. A corollary being that the method is guaranteed to converge to a Nash equilibrium in self-play so long as the regrets are ultimately realizable by the function approximator. Our technique can be understood as a principled generalization of existing work on abstraction in large games; in our work, both the abstraction as well as the equilibrium are learned during self-play. We demonstrate empirically the method achieves higher quality strategies than state-of-the-art abstraction techniques given the same resources.

* AAAI Conference on Artificial Intelligence 2015

View paper on

Share this with someone who'll enjoy it:

Title:Solving Games with Functional Regret Estimation

Paper and Code