Alert button

Removing RLHF Protections in GPT-4 via Fine-Tuning

Nov 09, 2023
Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, Daniel Kang

Figure 1 for Removing RLHF Protections in GPT-4 via Fine-Tuning
Figure 2 for Removing RLHF Protections in GPT-4 via Fine-Tuning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: