Alert button

Learning and Forgetting Unsafe Examples in Large Language Models

Dec 20, 2023
Jiachen Zhao, Zhun Deng, David Madras, James Zou, Mengye Ren

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: