Alert button
Picture for Joshua Uyheng

Joshua Uyheng

Alert button

Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 19, 2018
Julia Kreutzer, Joshua Uyheng, Stefan Riezler

Figure 1 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Figure 2 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Figure 3 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Figure 4 for Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Viaarxiv icon