Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Simple, Yet Effective Approach to Finding Biases in Code Generation

Oct 31, 2022

Spyridon Mouselinos, Mateusz Malinowski, Henryk Michalewski

Figure 1 for A Simple, Yet Effective Approach to Finding Biases in Code Generation

Figure 2 for A Simple, Yet Effective Approach to Finding Biases in Code Generation

Figure 3 for A Simple, Yet Effective Approach to Finding Biases in Code Generation

Figure 4 for A Simple, Yet Effective Approach to Finding Biases in Code Generation

Share this with someone who'll enjoy it:

Abstract:Recently, scores of high-performing code generation systems have surfaced. As has become a popular choice in many domains, code generation is often approached using large language models as a core, trained under the masked or causal language modeling schema. This work shows that current code generation systems exhibit biases inherited from large language model backbones, which might leak into generated code under specific circumstances. To investigate the effect, we propose a framework that automatically removes hints and exposes various biases that these code generation models use. We apply our framework to three coding challenges and test it across top-performing coding generation models. Our experiments reveal biases towards specific prompt structure and exploitation of keywords during code generation. Finally, we demonstrate how to use our framework as a data transformation technique, which we find a promising direction toward more robust code generation.

* Preprint

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:A Simple, Yet Effective Approach to Finding Biases in Code Generation

Paper and Code