Alert button

Scalable agent alignment via reward modeling: a research direction

Add code
Bookmark button
Alert button
Nov 19, 2018
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg

Figure 1 for Scalable agent alignment via reward modeling: a research direction
Figure 2 for Scalable agent alignment via reward modeling: a research direction
Figure 3 for Scalable agent alignment via reward modeling: a research direction
Figure 4 for Scalable agent alignment via reward modeling: a research direction

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: