A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Add code
Jul 02, 2024
Figure 1 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 2 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 3 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 4 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: