Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Jul 01, 2019

Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals

Figure 1 for Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Figure 2 for Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Figure 3 for Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Figure 4 for Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Share this with someone who'll enjoy it:

Abstract:Ultrasound tongue imaging (UTI) provides a convenient way to visualize the vocal tract during speech production. UTI is increasingly being used for speech therapy, making it important to develop automatic methods to assist various time-consuming manual tasks currently performed by speech therapists. A key challenge is to generalize the automatic processing of ultrasound tongue images to previously unseen speakers. In this work, we investigate the classification of phonetic segments (tongue shapes) from raw ultrasound recordings under several training scenarios: speaker-dependent, multi-speaker, speaker-independent, and speaker-adapted. We observe that models underperform when applied to data from speakers not seen at training time. However, when provided with minimal additional speaker information, such as the mean ultrasound frame, the models generalize better to unseen speakers.

* 5 pages, 4 figures, published in ICASSP2019 (IEEE International Conference on Acoustics, Speech and Signal Processing, 2019)

View paper on

Share this with someone who'll enjoy it:

Title:Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Paper and Code