Alert button
Picture for Jianliang He

Jianliang He

Alert button

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

Add code
Bookmark button
Alert button
Apr 19, 2024
Jianliang He, Han Zhong, Zhuoran Yang

Viaarxiv icon