Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Contextualized Code Representation Learning for Commit Message Generation

Jul 14, 2020

Lun Yiu Nie, Cuiyun Gao, Zhicong Zhong, Wai Lam, Yang Liu, Zenglin Xu

Figure 1 for Contextualized Code Representation Learning for Commit Message Generation

Figure 2 for Contextualized Code Representation Learning for Commit Message Generation

Figure 3 for Contextualized Code Representation Learning for Commit Message Generation

Figure 4 for Contextualized Code Representation Learning for Commit Message Generation

Share this with someone who'll enjoy it:

Abstract:Automatic generation of high-quality commit messages for code commits can substantially facilitate developers' works and coordination. However, the semantic gap between source code and natural language poses a major challenge for the task. Several studies have been proposed to alleviate the challenge but none explicitly involves code contextual information during commit message generation. Specifically, existing research adopts static embedding for code tokens, which maps a token to the same vector regardless of its context. In this paper, we propose a novel Contextualized code representation learning method for commit message Generation (CoreGen). CoreGen first learns contextualized code representation which exploits the contextual information behind code commit sequences. The learned representations of code commits built upon Transformer are then transferred for downstream commit message generation. Experiments on the benchmark dataset demonstrate the superior effectiveness of our model over the baseline models with an improvement of 28.18% in terms of BLEU-4 score. Furthermore, we also highlight the future opportunities in training contextualized code representations on larger code corpus as a solution to low-resource settings and adapting the pretrained code representations to other downstream code-to-text generation tasks.

View paper on

Share this with someone who'll enjoy it:

Title:Contextualized Code Representation Learning for Commit Message Generation

Paper and Code