Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Contrastive Semi-supervised Learning for ASR

Mar 09, 2021

Alex Xiao, Christian Fuegen, Abdelrahman Mohamed

Figure 1 for Contrastive Semi-supervised Learning for ASR

Figure 2 for Contrastive Semi-supervised Learning for ASR

Figure 3 for Contrastive Semi-supervised Learning for ASR

Figure 4 for Contrastive Semi-supervised Learning for ASR

Share this with someone who'll enjoy it:

Abstract:Pseudo-labeling is the most adopted method for pre-training automatic speech recognition (ASR) models. However, its performance suffers from the supervised teacher model's degrading quality in low-resource setups and under domain transfer. Inspired by the successes of contrastive representation learning for computer vision and speech applications, and more recently for supervised learning of visual objects, we propose Contrastive Semi-supervised Learning (CSL). CSL eschews directly predicting teacher-generated pseudo-labels in favor of utilizing them to select positive and negative examples. In the challenging task of transcribing public social media videos, using CSL reduces the WER by 8% compared to the standard Cross-Entropy pseudo-labeling (CE-PL) when 10hr of supervised data is used to annotate 75,000hr of videos. The WER reduction jumps to 19% under the ultra low-resource condition of using 1hr labels for teacher supervision. CSL generalizes much better in out-of-domain conditions, showing up to 17% WER reduction compared to the best CE-PL pre-trained model.

View paper on

Share this with someone who'll enjoy it:

Title:Contrastive Semi-supervised Learning for ASR

Paper and Code