Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fabien Cromieres

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

May 07, 2020

Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi

Figure 1 for JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Figure 2 for JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Figure 3 for JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Figure 4 for JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Abstract:Neural machine translation (NMT) needs large parallel corpora for state-of-the-art translation quality. Low-resource NMT is typically addressed by transfer learning which leverages large monolingual or parallel corpora for pre-training. Monolingual pre-training approaches such as MASS (MAsked Sequence to Sequence) are extremely effective in boosting NMT quality for languages with small parallel corpora. However, they do not account for linguistic information obtained using syntactic analyzers which is known to be invaluable for several Natural Language Processing (NLP) tasks. To this end, we propose JASS, Japanese-specific Sequence to Sequence, as a novel pre-training alternative to MASS for NMT involving Japanese as the source or target language. JASS is joint BMASS (Bunsetsu MASS) and BRSS (Bunsetsu Reordering Sequence to Sequence) pre-training which focuses on Japanese linguistic units called bunsetsus. In our experiments on ASPEC Japanese--English and News Commentary Japanese--Russian translation we show that JASS can give results that are competitive with if not better than those given by MASS. Furthermore, we show for the first time that joint MASS and JASS pre-training gives results that significantly surpass the individual methods indicating their complementary nature. We will release our code, pre-trained models and bunsetsu annotated data as resources for researchers to use in their own NLP tasks.

* LREC 2020

Via

Access Paper or Ask Questions

Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

Apr 03, 2017

Raj Dabre, Fabien Cromieres, Sadao Kurohashi

Figure 1 for Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

Figure 2 for Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

Figure 3 for Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

Figure 4 for Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

Abstract:In this paper, we propose a novel and elegant solution to "Multi-Source Neural Machine Translation" (MSNMT) which only relies on preprocessing a N-way multilingual corpus without modifying the Neural Machine Translation (NMT) architecture or training procedure. We simply concatenate the source sentences to form a single long multi-source input sentence while keeping the target side sentence as it is and train an NMT system using this preprocessed corpus. We evaluate our method in resource poor as well as resource rich settings and show its effectiveness (up to 4 BLEU using 2 source languages and up to 6 BLEU using 5 source languages) by comparing against existing methods for MSNMT. We also provide some insights on how the NMT system leverages multilingual information in such a scenario by visualizing attention.

* Added results for IWSLT corpus setting along with some typo corrections, additional references and statistics

Via

Access Paper or Ask Questions