What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety

Add code
Apr 01, 2024
Figure 1 for What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety
Figure 2 for What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety
Figure 3 for What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety
Figure 4 for What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: