Alert button

Universal and Transferable Adversarial Attacks on Aligned Language Models

Add code
Bookmark button
Alert button
Jul 27, 2023
Andy Zou, Zifan Wang, J. Zico Kolter, Matt Fredrikson

Figure 1 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Figure 2 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Figure 3 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Figure 4 for Universal and Transferable Adversarial Attacks on Aligned Language Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: