Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Geliang Chen

Phrase Based Language Model for Statistical Machine Translation: Empirical Study

Feb 18, 2015

Geliang Chen

Figure 1 for Phrase Based Language Model for Statistical Machine Translation: Empirical Study

Figure 2 for Phrase Based Language Model for Statistical Machine Translation: Empirical Study

Figure 3 for Phrase Based Language Model for Statistical Machine Translation: Empirical Study

Figure 4 for Phrase Based Language Model for Statistical Machine Translation: Empirical Study

Abstract:Reordering is a challenge to machine translation (MT) systems. In MT, the widely used approach is to apply word based language model (LM) which considers the constituent units of a sentence as words. In speech recognition (SR), some phrase based LM have been proposed. However, those LMs are not necessarily suitable or optimal for reordering. We propose two phrase based LMs which considers the constituent units of a sentence as phrases. Experiments show that our phrase based LMs outperform the word based LM with the respect of perplexity and n-best list re-ranking.

* supplementary material of http://arxiv.org/abs/1501.04324. This version is identical to the Bachelor thesis of Geliang Chen archived on the 20th June 2013 in Peking University. Thesis advisor: Professor Jia Xu

Via

Access Paper or Ask Questions

Phrase Based Language Model For Statistical Machine Translation

Jan 18, 2015

Jia Xu, Geliang Chen

Figure 1 for Phrase Based Language Model For Statistical Machine Translation

Figure 2 for Phrase Based Language Model For Statistical Machine Translation

Figure 3 for Phrase Based Language Model For Statistical Machine Translation

Figure 4 for Phrase Based Language Model For Statistical Machine Translation

Abstract:We consider phrase based Language Models (LM), which generalize the commonly used word level models. Similar concept on phrase based LMs appears in speech recognition, which is rather specialized and thus less suitable for machine translation (MT). In contrast to the dependency LM, we first introduce the exhaustive phrase-based LMs tailored for MT use. Preliminary experimental results show that our approach outperform word based LMs with the respect to perplexity and translation quality.

* 5 pages. This version of the paper was submitted for review to EMNLP 2013. The title, the idea and the content of this paper was presented by the first author in the machine translation group meeting at the MSRA-NLC lab (Microsoft Research Asia, Natural Language Computing) on July 16, 2013

Via

Access Paper or Ask Questions