Alert button

Weak-to-Strong Jailbreaking on Large Language Models

Add code
Bookmark button
Alert button
Jan 30, 2024
Xuandong Zhao, Xianjun Yang, Tianyu Pang, Chao Du, Lei Li, Yu-Xiang Wang, William Yang Wang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: