Alert button
Picture for Charlie Rogers-Smith

Charlie Rogers-Smith

Alert button

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

Add code
Bookmark button
Alert button
Oct 31, 2023
Pranav Gade, Simon Lermen, Charlie Rogers-Smith, Jeffrey Ladish

Figure 1 for BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Figure 2 for BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Figure 3 for BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Figure 4 for BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Viaarxiv icon

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Add code
Bookmark button
Alert button
Oct 31, 2023
Simon Lermen, Charlie Rogers-Smith, Jeffrey Ladish

Figure 1 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Figure 2 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Figure 3 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Figure 4 for LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Viaarxiv icon

Approximate Bayesian Computation via Population Monte Carlo and Classification

Add code
Bookmark button
Alert button
Oct 29, 2018
Charlie Rogers-Smith, Henri Pesonen, Samuel Kaski

Figure 1 for Approximate Bayesian Computation via Population Monte Carlo and Classification
Figure 2 for Approximate Bayesian Computation via Population Monte Carlo and Classification
Figure 3 for Approximate Bayesian Computation via Population Monte Carlo and Classification
Figure 4 for Approximate Bayesian Computation via Population Monte Carlo and Classification
Viaarxiv icon