Alert button
Picture for Weikai Lu

Weikai Lu

Alert button

Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge

Add code
Bookmark button
Alert button
Apr 08, 2024
Weikai Lu, Ziqian Zeng, Jianwei Wang, Zhengdong Lu, Zelin Chen, Huiping Zhuang, Cen Chen

Viaarxiv icon