Alert button
Picture for Nikolaus H. R. Howe

Nikolaus H. R. Howe

Alert button

Defining and Characterizing Reward Hacking

Add code
Bookmark button
Alert button
Sep 27, 2022
Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger

Figure 1 for Defining and Characterizing Reward Hacking
Figure 2 for Defining and Characterizing Reward Hacking
Figure 3 for Defining and Characterizing Reward Hacking
Figure 4 for Defining and Characterizing Reward Hacking
Viaarxiv icon

Myriad: a real-world testbed to bridge trajectory optimization and deep learning

Add code
Bookmark button
Alert button
Feb 22, 2022
Nikolaus H. R. Howe, Simon Dufort-Labbé, Nitarshan Rajkumar, Pierre-Luc Bacon

Figure 1 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Figure 2 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Figure 3 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Figure 4 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Viaarxiv icon