Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Sep 06, 2020

Akhil Mathur, Fahim Kawsar, Nadia Berthouze, Nicholas D. Lane

Figure 1 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Figure 2 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Figure 3 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Figure 4 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Share this with someone who'll enjoy it:

Abstract:This paper introduces a new dataset, Libri-Adapt, to support unsupervised domain adaptation research on speech recognition models. Built on top of the LibriSpeech corpus, Libri-Adapt contains English speech recorded on mobile and embedded-scale microphones, and spans 72 different domains that are representative of the challenging practical scenarios encountered by ASR models. More specifically, Libri-Adapt facilitates the study of domain shifts in ASR models caused by a) different acoustic environments, b) variations in speaker accents, c) heterogeneity in the hardware and platform software of the microphones, and d) a combination of the aforementioned three shifts. We also provide a number of baseline results quantifying the impact of these domain shifts on the Mozilla DeepSpeech2 ASR model.

* 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pp. 7439-7443 * 5 pages, Published at IEEE ICASSP 2020

View paper on

Share this with someone who'll enjoy it:

Title:Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Paper and Code