Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox


Adaptive Monte Carlo via Bandit Allocation

May 13, 2014
James Neufeld, András György, Dale Schuurmans, Csaba Szepesvári



We consider the problem of sequentially choosing between a set of unbiased Monte Carlo estimators to minimize the mean-squared-error (MSE) of a final combined estimate. By reducing this task to a stochastic multi-armed bandit problem, we show that well developed allocation strategies can be used to achieve an MSE that approaches that of the best estimator chosen in retrospect. We then extend these developments to a scenario where alternative estimators have different, possibly stochastic costs. The outcome is a new set of adaptive Monte Carlo strategies that provide stronger guarantees than previous approaches while offering practical advantages.

* The 31st International Conference on Machine Learning (ICML 2014) 


Share this with someone who'll enjoy it:

   Access Paper Source



Share this with someone who'll enjoy it: