Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

Aug 11, 2018

Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang, Hung-yi Lee

Figure 1 for Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

Figure 2 for Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

Figure 3 for Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

Figure 4 for Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

Share this with someone who'll enjoy it:

Abstract:Automatic speech recognition (ASR) has been widely researched with supervised approaches, while many low-resourced languages lack audio-text aligned data, and supervised methods cannot be applied on them. In this work, we propose a framework to achieve unsupervised ASR on a read English speech dataset, where audio and text are unaligned. In the first stage, each word-level audio segment in the utterances is represented by a vector representation extracted by a sequence-of-sequence autoencoder, in which phonetic information and speaker information are disentangled. Secondly, semantic embeddings of audio segments are trained from the vector representations using a skip-gram model. Last but not the least, an unsupervised method is utilized to transform semantic embeddings of audio segments to text embedding space, and finally the transformed embeddings are mapped to words. With the above framework, we are towards unsupervised ASR trained by unaligned text and speech only.

* Code is released: https://github.com/grtzsohalf/Towards-Unsupervised-ASR

View paper on

Share this with someone who'll enjoy it:

Title:Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only

Paper and Code