Alert button

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP

Jan 29, 2021
Zihan Zhang, Jiaqi Yang, Xiangyang Ji, Simon S. Du

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: