Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation

Nov 01, 2023

Xiangjue Dong, Yibo Wang, Philip S. Yu, James Caverlee

Figure 1 for Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation

Figure 2 for Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation

Figure 3 for Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation

Figure 4 for Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation

Share this with someone who'll enjoy it:

Abstract:Large Language Models (LLMs) can generate biased and toxic responses. Yet most prior work on LLM gender bias evaluation requires predefined gender-related phrases or gender stereotypes, which are challenging to be comprehensively collected and are limited to explicit bias evaluation. In addition, we believe that instances devoid of gender-related language or explicit stereotypes in inputs can still induce gender bias in LLMs. Thus, in this work, we propose a conditional text generation mechanism without the need for predefined gender phrases and stereotypes. This approach employs three types of inputs generated through three distinct strategies to probe LLMs, aiming to show evidence of explicit and implicit gender biases in LLMs. We also utilize explicit and implicit evaluation metrics to evaluate gender bias in LLMs under different strategies. Our experiments demonstrate that an increased model size does not consistently lead to enhanced fairness and all tested LLMs exhibit explicit and/or implicit gender bias, even when explicit gender stereotypes are absent in the inputs.

* Accepted in Socially Responsible Language Modelling Research (SoLaR) 2023 at NeurIPS 2023; the first two authors contribute equally

View paper on

Share this with someone who'll enjoy it:

Title:Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation

Paper and Code