Alert button

Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards

Apr 26, 2023
Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: