Alert button

Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 19, 2018
Julia Kreutzer, Joshua Uyheng, Stefan Riezler

Figure 1 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Figure 2 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Figure 3 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Figure 4 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: