Alert button

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Oct 31, 2023
Simon Lermen, Charlie Rogers-Smith, Jeffrey Ladish

Figure 1 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Figure 2 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Figure 3 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Figure 4 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: