Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

Dec 28, 2019

Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim

Figure 1 for Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

Figure 2 for Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

Figure 3 for Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

Figure 4 for Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

Share this with someone who'll enjoy it:

Abstract:In this paper, we propose a refined multi-stage multi-task training strategy to improve the performance of online attention-based encoder-decoder (AED) models. A three-stage training based on three levels of architectural granularity namely, character encoder, byte pair encoding (BPE) based encoder, and attention decoder, is proposed. Also, multi-task learning based on two-levels of linguistic granularity namely, character and BPE, is used. We explore different pre-training strategies for the encoders including transfer learning from a bidirectional encoder. Our encoder-decoder models with online attention show 35% and 10% relative improvement over their baselines for smaller and bigger models, respectively. Our models achieve a word error rate (WER) of 5.04% and 4.48% on the Librispeech test-clean data for the smaller and bigger models respectively after fusion with long short-term memory (LSTM) based external language model (LM).

* Accepted and presented at the ASRU 2019 conference

View paper on

Share this with someone who'll enjoy it:

Title:Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

Paper and Code