Alert button

End-to-End Spoken Language Translation

Apr 23, 2019
Michelle Guo, Albert Haque, Prateek Verma

Figure 1 for End-to-End Spoken Language Translation
Figure 2 for End-to-End Spoken Language Translation
Figure 3 for End-to-End Spoken Language Translation
Figure 4 for End-to-End Spoken Language Translation

Share this with someone who'll enjoy it:

In this paper, we address the task of spoken language understanding. We present a method for translating spoken sentences from one language into spoken sentences in another language. Given spectrogram-spectrogram pairs, our model can be trained completely from scratch to translate unseen sentences. Our method consists of a pyramidal-bidirectional recurrent network combined with a convolutional network to output sentence-level spectrograms in the target language. Empirically, our model achieves competitive performance with state-of-the-art methods on multiple languages and can generalize to unseen speakers.

* Technical Report. Stanford University, 2017. arXiv admin note: text overlap with arXiv:1804.00047  
View paper onarxiv icon

Share this with someone who'll enjoy it: