Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shanq-Jang Ruan

Deep Learning-based automated classification of Chinese Speech Sound Disorders

May 24, 2022

Yao-Ming Kuo, Shanq-Jang Ruan, Yu-Chin Chen, Ya-Wen Tu

Figure 1 for Deep Learning-based automated classification of Chinese Speech Sound Disorders

Figure 2 for Deep Learning-based automated classification of Chinese Speech Sound Disorders

Figure 3 for Deep Learning-based automated classification of Chinese Speech Sound Disorders

Figure 4 for Deep Learning-based automated classification of Chinese Speech Sound Disorders

Abstract:This article describes a system for analyzing acoustic data in order to assist in the diagnosis and classification of children's speech disorders using a computer. The analysis concentrated on identifying and categorizing four distinct types of Chinese misconstructions. The study collected and generated a speech corpus containing 2540 Stopping, Velar, Consonant-vowel, and Affricate samples from 90 children aged 3-6 years with normal or pathological articulatory features. Each recording was accompanied by a detailed annotation from the field of speech therapy. Classification of the speech samples was accomplished using three well-established neural network models for image classification. The feature maps are created using three sets of MFCC parameters extracted from speech sounds and aggregated into a three-dimensional data structure as model input. We employ six techniques for data augmentation in order to augment the available dataset while avoiding over-simulation. The experiments examine the usability of four different categories of Chinese phrases and characters. Experiments with different data subsets demonstrate the system's ability to accurately detect the analyzed pronunciation disorders.

* 12 pages, 9 figures, journal

Via

Access Paper or Ask Questions