Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hany Hassan Awadallah

Leveraging GPT-4 for Automatic Translation Post-Editing

May 24, 2023

Vikas Raunak, Amr Sharaf, Hany Hassan Awadallah, Arul Menezes

Figure 1 for Leveraging GPT-4 for Automatic Translation Post-Editing

Figure 2 for Leveraging GPT-4 for Automatic Translation Post-Editing

Figure 3 for Leveraging GPT-4 for Automatic Translation Post-Editing

Figure 4 for Leveraging GPT-4 for Automatic Translation Post-Editing

Abstract:While Neural Machine Translation (NMT) represents the leading approach to Machine Translation (MT), the outputs of NMT models still require translation post-editing to rectify errors and enhance quality, particularly under critical settings. In this work, we formalize the task of translation post-editing with Large Language Models (LLMs) and explore the use of GPT-4 to automatically post-edit NMT outputs across several language pairs. Our results demonstrate that GPT-4 is adept at translation post-editing and produces meaningful edits even when the target language is not English. Notably, we achieve state-of-the-art performance on WMT-22 English-Chinese, English-German, Chinese-English and German-English language pairs using GPT-4 based post-editing, as evaluated by state-of-the-art MT quality metrics.

Via

Access Paper or Ask Questions

Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Aug 14, 2022

Hossam Amer, Young Jin Kim, Mohamed Afify, Hitokazu Matsushita, Hany Hassan Awadallah

Figure 1 for Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Figure 2 for Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Figure 3 for Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Figure 4 for Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Abstract:Multilingual Neural Machine Translation has been showing great success using transformer models. Deploying these models is challenging because they usually require large vocabulary (vocab) sizes for various languages. This limits the speed of predicting the output tokens in the last vocab projection layer. To alleviate these challenges, this paper proposes a fast vocabulary projection method via clustering which can be used for multilingual transformers on GPUs. First, we offline split the vocab search space into disjoint clusters given the hidden context vector of the decoder output, which results in much smaller vocab columns for vocab projection. Second, at inference time, the proposed method predicts the clusters and candidate active tokens for hidden context vectors at the vocab projection. This paper also includes analysis of different ways of building these clusters in multilingual settings. Our results show end-to-end speed gains in float16 GPU inference up to 25% while maintaining the BLEU score and slightly increasing memory cost. The proposed method speeds up the vocab projection step itself by up to 2.6x. We also conduct an extensive human evaluation to verify the proposed method preserves the quality of the translations from the original model.

* 12 pages, accepted at AMTA-2022 (Association for Machine Translation in the Americas Conference)

Via

Access Paper or Ask Questions