Alert button
Picture for Jackie CK Cheung

Jackie CK Cheung

Alert button

From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards

Add code
Bookmark button
Alert button
Mar 21, 2024
Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taik, Jackie CK Cheung, Golnoosh Farnadi

Figure 1 for From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Figure 2 for From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Figure 3 for From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Figure 4 for From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Viaarxiv icon

Unsupervised Layer-wise Score Aggregation for Textual OOD Detection

Add code
Bookmark button
Alert button
Feb 20, 2023
Maxime Darrin, Guillaume Staerman, Eduardo Dadalto Câmara Gomes, Jackie CK Cheung, Pablo Piantanida, Pierre Colombo

Figure 1 for Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Figure 2 for Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Figure 3 for Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Figure 4 for Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Viaarxiv icon