Picture for Min-Hwan Oh

Min-Hwan Oh

Variance-Adaptive Optimal Algorithm for Reinforcement Learning with Multinomial Logit Function Approximation

Add code
May 27, 2026
Viaarxiv icon