Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

May 20, 2020

Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur

Figure 1 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Figure 2 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Figure 3 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Figure 4 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Share this with someone who'll enjoy it:

Abstract:We present PyChain, a fully parallelized PyTorch implementation of end-to-end lattice-free maximum mutual information (LF-MMI) training for the so-called \emph{chain models} in the Kaldi automatic speech recognition (ASR) toolkit. Unlike other PyTorch and Kaldi based ASR toolkits, PyChain is designed to be as flexible and light-weight as possible so that it can be easily plugged into new ASR projects, or other existing PyTorch-based ASR tools, as exemplified respectively by a new project PyChain-example, and Espresso, an existing end-to-end ASR toolkit. PyChain's efficiency and flexibility is demonstrated through such novel features as full GPU training on numerator/denominator graphs, and support for unequal length sequences. Experiments on the WSJ dataset show that with simple neural networks and commonly used machine learning techniques, PyChain can achieve competitive results that are comparable to Kaldi and better than other end-to-end ASR systems.

* Submtted to Interspeech 2020

View paper on

Share this with someone who'll enjoy it:

Title:PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Paper and Code