Alert button
Picture for Juan Rocamonde

Juan Rocamonde

Alert button

Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 19, 2023
Juan Rocamonde, Victoriano Montesinos, Elvis Nava, Ethan Perez, David Lindner

Viaarxiv icon

imitation: Clean Imitation Learning Implementations

Add code
Bookmark button
Alert button
Nov 22, 2022
Adam Gleave, Mohammad Taufeeque, Juan Rocamonde, Erik Jenner, Steven H. Wang, Sam Toyer, Maximilian Ernestus, Nora Belrose, Scott Emmons, Stuart Russell

Figure 1 for imitation: Clean Imitation Learning Implementations
Figure 2 for imitation: Clean Imitation Learning Implementations
Figure 3 for imitation: Clean Imitation Learning Implementations
Figure 4 for imitation: Clean Imitation Learning Implementations
Viaarxiv icon