Alert button
Picture for Olivia Watkins

Olivia Watkins

Alert button

A StrongREJECT for Empty Jailbreaks

Add code
Bookmark button
Alert button
Feb 15, 2024
Alexandra Souly, Qingyuan Lu, Dillon Bowen, Tu Trinh, Elvis Hsieh, Sana Pandey, Pieter Abbeel, Justin Svegliato, Scott Emmons, Olivia Watkins, Sam Toyer

Viaarxiv icon

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Add code
Bookmark button
Alert button
Nov 02, 2023
Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell

Viaarxiv icon

Learning to Model the World with Language

Add code
Bookmark button
Alert button
Jul 31, 2023
Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan

Figure 1 for Learning to Model the World with Language
Figure 2 for Learning to Model the World with Language
Figure 3 for Learning to Model the World with Language
Figure 4 for Learning to Model the World with Language
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
May 25, 2023
Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee

Figure 1 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 2 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 3 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 4 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Bookmark button
Alert button
Feb 23, 2023
Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Shixiang Shane Gu

Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Guiding Pretraining in Reinforcement Learning with Large Language Models

Add code
Bookmark button
Alert button
Feb 13, 2023
Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas

Figure 1 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 2 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 3 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 4 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Viaarxiv icon

Teachable Reinforcement Learning via Advice Distillation

Add code
Bookmark button
Alert button
Mar 19, 2022
Olivia Watkins, Trevor Darrell, Pieter Abbeel, Jacob Andreas, Abhishek Gupta

Figure 1 for Teachable Reinforcement Learning via Advice Distillation
Figure 2 for Teachable Reinforcement Learning via Advice Distillation
Figure 3 for Teachable Reinforcement Learning via Advice Distillation
Figure 4 for Teachable Reinforcement Learning via Advice Distillation
Viaarxiv icon

Explaining Reinforcement Learning Policies through Counterfactual Trajectories

Add code
Bookmark button
Alert button
Jan 29, 2022
Julius Frost, Olivia Watkins, Eric Weiner, Pieter Abbeel, Trevor Darrell, Bryan Plummer, Kate Saenko

Figure 1 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Figure 2 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Figure 3 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Figure 4 for Explaining Reinforcement Learning Policies through Counterfactual Trajectories
Viaarxiv icon

Auto-Tuned Sim-to-Real Transfer

Add code
Bookmark button
Alert button
Apr 15, 2021
Yuqing Du, Olivia Watkins, Trevor Darrell, Pieter Abbeel, Deepak Pathak

Figure 1 for Auto-Tuned Sim-to-Real Transfer
Figure 2 for Auto-Tuned Sim-to-Real Transfer
Figure 3 for Auto-Tuned Sim-to-Real Transfer
Figure 4 for Auto-Tuned Sim-to-Real Transfer
Viaarxiv icon

Hierarchical Text Generation using an Outline

Add code
Bookmark button
Alert button
Oct 20, 2018
Mehdi Drissi, Olivia Watkins, Jugal Kalita

Figure 1 for Hierarchical Text Generation using an Outline
Figure 2 for Hierarchical Text Generation using an Outline
Figure 3 for Hierarchical Text Generation using an Outline
Viaarxiv icon