Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Lattice-based lightly-supervised acoustic model training

May 30, 2019

Joachim Fainberg, Ondřej Klejch, Steve Renals, Peter Bell

Figure 1 for Lattice-based lightly-supervised acoustic model training

Figure 2 for Lattice-based lightly-supervised acoustic model training

Figure 3 for Lattice-based lightly-supervised acoustic model training

Figure 4 for Lattice-based lightly-supervised acoustic model training

Share this with someone who'll enjoy it:

Abstract:In the broadcast domain there is an abundance of related text data and partial transcriptions, such as closed captions and subtitles. This text data can be used for lightly supervised training, in which text matching the audio is selected using an existing speech recognition model. Current approaches to light supervision typically filter the data based on matching error rates between the transcriptions and biased decoding hypotheses. In contrast, semi-supervised training does not require matching text data, instead generating a hypothesis using a background language model. State-of-the-art semi-supervised training uses lattice-based supervision with the lattice-free MMI (LF-MMI) objective function. We propose a technique to combine inaccurate transcriptions with the lattices generated for semi-supervised training, thus preserving uncertainty in the lattice where appropriate. We demonstrate that this combined approach reduces the expected error rates over the lattices, and reduces the word error rate (WER) on a broadcast task.

* Submitted to INTERSPEECH 2019

View paper on

Share this with someone who'll enjoy it:

Title:Lattice-based lightly-supervised acoustic model training

Paper and Code