Alert button
Picture for Sam Toyer

Sam Toyer

Alert button

A StrongREJECT for Empty Jailbreaks

Add code
Bookmark button
Alert button
Feb 15, 2024
Alexandra Souly, Qingyuan Lu, Dillon Bowen, Tu Trinh, Elvis Hsieh, Sana Pandey, Pieter Abbeel, Justin Svegliato, Scott Emmons, Olivia Watkins, Sam Toyer

Viaarxiv icon

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Add code
Bookmark button
Alert button
Nov 02, 2023
Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell

Viaarxiv icon

imitation: Clean Imitation Learning Implementations

Add code
Bookmark button
Alert button
Nov 22, 2022
Adam Gleave, Mohammad Taufeeque, Juan Rocamonde, Erik Jenner, Steven H. Wang, Sam Toyer, Maximilian Ernestus, Nora Belrose, Scott Emmons, Stuart Russell

Figure 1 for imitation: Clean Imitation Learning Implementations
Figure 2 for imitation: Clean Imitation Learning Implementations
Figure 3 for imitation: Clean Imitation Learning Implementations
Figure 4 for imitation: Clean Imitation Learning Implementations
Viaarxiv icon

An Empirical Investigation of Representation Learning for Imitation

Add code
Bookmark button
Alert button
May 16, 2022
Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

Figure 1 for An Empirical Investigation of Representation Learning for Imitation
Figure 2 for An Empirical Investigation of Representation Learning for Imitation
Figure 3 for An Empirical Investigation of Representation Learning for Imitation
Figure 4 for An Empirical Investigation of Representation Learning for Imitation
Viaarxiv icon

A Primer on Maximum Causal Entropy Inverse Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 22, 2022
Adam Gleave, Sam Toyer

Figure 1 for A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Viaarxiv icon

DERAIL: Diagnostic Environments for Reward And Imitation Learning

Add code
Bookmark button
Alert button
Dec 02, 2020
Pedro Freire, Adam Gleave, Sam Toyer, Stuart Russell

Figure 1 for DERAIL: Diagnostic Environments for Reward And Imitation Learning
Figure 2 for DERAIL: Diagnostic Environments for Reward And Imitation Learning
Figure 3 for DERAIL: Diagnostic Environments for Reward And Imitation Learning
Figure 4 for DERAIL: Diagnostic Environments for Reward And Imitation Learning
Viaarxiv icon

The MAGICAL Benchmark for Robust Imitation

Add code
Bookmark button
Alert button
Nov 01, 2020
Sam Toyer, Rohin Shah, Andrew Critch, Stuart Russell

Figure 1 for The MAGICAL Benchmark for Robust Imitation
Figure 2 for The MAGICAL Benchmark for Robust Imitation
Figure 3 for The MAGICAL Benchmark for Robust Imitation
Figure 4 for The MAGICAL Benchmark for Robust Imitation
Viaarxiv icon

ASNets: Deep Learning for Generalised Planning

Add code
Bookmark button
Alert button
Aug 04, 2019
Sam Toyer, Felipe Trevizan, Sylvie Thiébaux, Lexing Xie

Figure 1 for ASNets: Deep Learning for Generalised Planning
Figure 2 for ASNets: Deep Learning for Generalised Planning
Figure 3 for ASNets: Deep Learning for Generalised Planning
Figure 4 for ASNets: Deep Learning for Generalised Planning
Viaarxiv icon

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow

Add code
Bookmark button
Alert button
Oct 01, 2018
Xue Bin Peng, Angjoo Kanazawa, Sam Toyer, Pieter Abbeel, Sergey Levine

Figure 1 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 2 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 3 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 4 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Viaarxiv icon

Action Schema Networks: Generalised Policies with Deep Learning

Add code
Bookmark button
Alert button
Dec 22, 2017
Sam Toyer, Felipe Trevizan, Sylvie Thiébaux, Lexing Xie

Figure 1 for Action Schema Networks: Generalised Policies with Deep Learning
Figure 2 for Action Schema Networks: Generalised Policies with Deep Learning
Figure 3 for Action Schema Networks: Generalised Policies with Deep Learning
Viaarxiv icon