Picture for Gary Lopez

Gary Lopez

Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

Add code
Jul 18, 2024
Figure 1 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 2 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 3 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 4 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Viaarxiv icon

Defending Against Indirect Prompt Injection Attacks With Spotlighting

Add code
Mar 20, 2024
Viaarxiv icon