Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Lattice Transformer for Speech Translation

Jun 13, 2019

Pei Zhang, Boxing Chen, Niyu Ge, Kai Fan

Figure 1 for Lattice Transformer for Speech Translation

Figure 2 for Lattice Transformer for Speech Translation

Figure 3 for Lattice Transformer for Speech Translation

Figure 4 for Lattice Transformer for Speech Translation

Share this with someone who'll enjoy it:

Abstract:Recent advances in sequence modeling have highlighted the strengths of the transformer architecture, especially in achieving state-of-the-art machine translation results. However, depending on the up-stream systems, e.g., speech recognition, or word segmentation, the input to translation system can vary greatly. The goal of this work is to extend the attention mechanism of the transformer to naturally consume the lattice in addition to the traditional sequential input. We first propose a general lattice transformer for speech translation where the input is the output of the automatic speech recognition (ASR) which contains multiple paths and posterior scores. To leverage the extra information from the lattice structure, we develop a novel controllable lattice attention mechanism to obtain latent representations. On the LDC Spanish-English speech translation corpus, our experiments show that lattice transformer generalizes significantly better and outperforms both a transformer baseline and a lattice LSTM. Additionally, we validate our approach on the WMT 2017 Chinese-English translation task with lattice inputs from different BPE segmentations. In this task, we also observe the improvements over strong baselines.

* accepted to ACL 2019

View paper on

Share this with someone who'll enjoy it:

Title:Lattice Transformer for Speech Translation

Paper and Code