Alert button

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Add code
Bookmark button
Alert button
Sep 14, 2023
Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, James Zou

Figure 1 for Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Figure 2 for Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Figure 3 for Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Figure 4 for Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: