Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

Feb 01, 2021

Mukuntha Narayanan Sundararaman, Ayush Kumar, Jithendra Vepa

Figure 1 for Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

Figure 2 for Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

Figure 3 for Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

Figure 4 for Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

Share this with someone who'll enjoy it:

Abstract:Recent years have witnessed significant improvement in ASR systems to recognize spoken utterances. However, it is still a challenging task for noisy and out-of-domain data, where substitution and deletion errors are prevalent in the transcribed text. These errors significantly degrade the performance of downstream tasks. In this work, we propose a BERT-style language model, referred to as PhonemeBERT, that learns a joint language model with phoneme sequence and ASR transcript to learn phonetic-aware representations that are robust to ASR errors. We show that PhonemeBERT can be used on downstream tasks using phoneme sequences as additional features, and also in low-resource setup where we only have ASR-transcripts for the downstream tasks with no phoneme information available. We evaluate our approach extensively by generating noisy data for three benchmark datasets - Stanford Sentiment Treebank, TREC and ATIS for sentiment, question and intent classification tasks respectively. The results of the proposed approach beats the state-of-the-art baselines comprehensively on each dataset.

View paper on

Share this with someone who'll enjoy it:

Title:Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript

Paper and Code