Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

May 11, 2020

Aditya Siddhant, Ankur Bapna, Yuan Cao, Orhan Firat, Mia Chen, Sneha Kudugunta, Naveen Arivazhagan, Yonghui Wu

Figure 1 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Figure 2 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Figure 3 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Figure 4 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Share this with someone who'll enjoy it:

Abstract:Over the last few years two promising research directions in low-resource neural machine translation (NMT) have emerged. The first focuses on utilizing high-resource languages to improve the quality of low-resource languages via multilingual NMT. The second direction employs monolingual data with self-supervision to pre-train translation models, followed by fine-tuning on small amounts of supervised data. In this work, we join these two lines of research and demonstrate the efficacy of monolingual data with self-supervision in multilingual NMT. We offer three major results: (i) Using monolingual data significantly boosts the translation quality of low-resource languages in multilingual models. (ii) Self-supervision improves zero-shot translation quality in multilingual models. (iii) Leveraging monolingual data with self-supervision provides a viable path towards adding new languages to multilingual models, getting up to 33 BLEU on ro-en translation without any parallel data or back-translation.

* ACL 2020

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Paper and Code