Alert button

Provably Efficient $Q$-learning with Function Approximation via Distribution Shift Error Checking Oracle

Jun 14, 2019
Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: