Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Nov 29, 2018

Mridul Agarwal, Vaneet Aggarwal

Figure 1 for Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Figure 2 for Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Figure 3 for Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Figure 4 for Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Share this with someone who'll enjoy it:

Abstract:Many real-world problems face the dilemma of choosing best $K$ out of $N$ options at a given time instant. This setup can be modelled as combinatorial bandit which chooses $K$ out of $N$ arms at each time, with an aim to achieve an efficient tradeoff between exploration and exploitation. This is the first work for combinatorial bandit where the reward received can be a non-linear function of the chosen $K$ arms. The direct use of multi-armed bandit requires choosing among $N$-choose-$K$ options making the state space large. In this paper, we present a novel algorithm which is computationally efficient and the storage is linear in $N$. The proposed algorithm is a divide-and-conquer based strategy, that we call CMAB-SM. Further, the proposed algorithm achieves a regret bound of $\tilde O(K^\frac{1}{2}N^\frac{1}{3}T^\frac{2}{3})$ for a time horizon $T$, which is sub-linear in all parameters $T$, $N$, and $K$. The evaluation results on different reward functions and arm distribution functions show significantly improved performance as compared to standard multi-armed bandit approach with $\binom{N}{K}$ choices.

* 32 pages

View paper on

Share this with someone who'll enjoy it:

Title:Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Paper and Code