Alert button

AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models

Jan 17, 2024
Dong shu, Mingyu Jin, Suiyuan Zhu, Beichen Wang, Zihao Zhou, Chong Zhang, Yongfeng Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: