Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Balanced Linear Contextual Bandits

Dec 15, 2018

Maria Dimakopoulou, Zhengyuan Zhou, Susan Athey, Guido Imbens

Figure 1 for Balanced Linear Contextual Bandits

Figure 2 for Balanced Linear Contextual Bandits

Figure 3 for Balanced Linear Contextual Bandits

Figure 4 for Balanced Linear Contextual Bandits

Share this with someone who'll enjoy it:

Abstract:Contextual bandit algorithms are sensitive to the estimation method of the outcome model as well as the exploration method used, particularly in the presence of rich heterogeneity or complex outcome models, which can lead to difficult estimation problems along the path of learning. We develop algorithms for contextual bandits with linear payoffs that integrate balancing methods from the causal inference literature in their estimation to make it less prone to problems of estimation bias. We provide the first regret bound analyses for linear contextual bandits with balancing and show that our algorithms match the state of the art theoretical guarantees. We demonstrate the strong practical advantage of balanced contextual bandits on a large number of supervised learning datasets and on a synthetic example that simulates model misspecification and prejudice in the initial training data.

* AAAI 2019 Oral Presentation. arXiv admin note: substantial text overlap with arXiv:1711.07077

View paper on

Share this with someone who'll enjoy it:

Title:Balanced Linear Contextual Bandits

Paper and Code