Alert button

Debiasing NLP Models Without Demographic Information

Dec 20, 2022
Hadas Orgad, Yonatan Belinkov

Figure 1 for Debiasing NLP Models Without Demographic Information
Figure 2 for Debiasing NLP Models Without Demographic Information
Figure 3 for Debiasing NLP Models Without Demographic Information
Figure 4 for Debiasing NLP Models Without Demographic Information

Share this with someone who'll enjoy it:

Models trained from real-world data tend to imitate and amplify social biases. Although there are many methods suggested to mitigate biases, they require a preliminary information on the types of biases that should be mitigated (e.g., gender or racial bias) and the social groups associated with each data sample. In this work, we propose a debiasing method that operates without any prior knowledge of the demographics in the dataset, detecting biased examples based on an auxiliary model that predicts the main model's success and down-weights them during the training process. Results on racial and gender bias demonstrate that it is possible to mitigate social biases without having to use a costly demographic annotation process.

View paper onarxiv icon

Share this with someone who'll enjoy it: