Alert button
Picture for Bekzhan Kerimkulov

Bekzhan Kerimkulov

Alert button

A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

Add code
Bookmark button
Alert button
Oct 04, 2023
Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

Viaarxiv icon

Convergence of policy gradient for entropy regularized MDPs with neural network approximation in the mean-field regime

Add code
Bookmark button
Alert button
Jan 18, 2022
Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch

Viaarxiv icon