Alert button

Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings

Jun 30, 2020
Xiao-Yue Gong, David Simchi-Levi

Figure 1 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 2 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 3 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 4 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: