Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dipayan Sen

Adaptive Estimation of Random Vectors with Bandit Feedback

Apr 01, 2022

Dipayan Sen, Prashanth L. A., Aditya Gopalan

Figure 1 for Adaptive Estimation of Random Vectors with Bandit Feedback

Figure 2 for Adaptive Estimation of Random Vectors with Bandit Feedback

Figure 3 for Adaptive Estimation of Random Vectors with Bandit Feedback

Figure 4 for Adaptive Estimation of Random Vectors with Bandit Feedback

Abstract:We consider the problem of sequentially learning to estimate, in the mean squared error (MSE) sense, a Gaussian $K$-vector of unknown covariance by observing only $m < K$ of its entries in each round. This reduces to learning an optimal subset for estimating the entire vector. Towards this, we first establish an exponential concentration bound for an estimate of the MSE for each observable subset. We then frame the estimation problem with bandit feedback in the best-subset identification setting. We propose a variant of the successive elimination algorithm to cater to the adaptive estimation problem, and we derive an upper bound on the sample complexity of this algorithm. In addition, to understand the fundamental limit on the sample complexity of this adaptive estimation bandit problem, we derive a minimax lower bound.

Via

Access Paper or Ask Questions