Picture for Chon Wai Ho

Chon Wai Ho

Bayesian learning of the optimal action-value function in a Markov decision process

Add code
May 03, 2025
Viaarxiv icon