Alert button

Reward learning from human preferences and demonstrations in Atari

Nov 15, 2018
Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei

Figure 1 for Reward learning from human preferences and demonstrations in Atari
Figure 2 for Reward learning from human preferences and demonstrations in Atari
Figure 3 for Reward learning from human preferences and demonstrations in Atari
Figure 4 for Reward learning from human preferences and demonstrations in Atari

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: