Picture for Svetlana Kiritchenko

Svetlana Kiritchenko

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models

Add code
Jun 08, 2022
Figure 1 for Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models
Figure 2 for Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models
Viaarxiv icon

Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy

Add code
May 25, 2022
Figure 1 for Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy
Figure 2 for Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy
Figure 3 for Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy
Figure 4 for Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy
Viaarxiv icon

Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection

Add code
May 06, 2022
Figure 1 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Figure 2 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Figure 3 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Figure 4 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Viaarxiv icon

Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors

Add code
Apr 05, 2022
Figure 1 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 2 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 3 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 4 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Viaarxiv icon

Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model

Add code
Jun 04, 2021
Figure 1 for Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model
Figure 2 for Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model
Figure 3 for Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model
Figure 4 for Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model
Viaarxiv icon

Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective

Add code
Dec 22, 2020
Figure 1 for Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
Figure 2 for Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
Figure 3 for Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
Figure 4 for Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
Viaarxiv icon

On Cross-Dataset Generalization in Automatic Detection of Online Abuse

Add code
Nov 03, 2020
Figure 1 for On Cross-Dataset Generalization in Automatic Detection of Online Abuse
Figure 2 for On Cross-Dataset Generalization in Automatic Detection of Online Abuse
Figure 3 for On Cross-Dataset Generalization in Automatic Detection of Online Abuse
Figure 4 for On Cross-Dataset Generalization in Automatic Detection of Online Abuse
Viaarxiv icon

Towards Ethics by Design in Online Abusive Content Detection

Add code
Oct 28, 2020
Figure 1 for Towards Ethics by Design in Online Abusive Content Detection
Figure 2 for Towards Ethics by Design in Online Abusive Content Detection
Figure 3 for Towards Ethics by Design in Online Abusive Content Detection
Viaarxiv icon

SOLO: A Corpus of Tweets for Examining the State of Being Alone

Add code
Jun 04, 2020
Figure 1 for SOLO: A Corpus of Tweets for Examining the State of Being Alone
Figure 2 for SOLO: A Corpus of Tweets for Examining the State of Being Alone
Figure 3 for SOLO: A Corpus of Tweets for Examining the State of Being Alone
Figure 4 for SOLO: A Corpus of Tweets for Examining the State of Being Alone
Viaarxiv icon