Alert button

"speech": models, code, and papers
Alert button

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

Oct 18, 2022
Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

Figure 1 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Figure 2 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Figure 3 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Figure 4 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Viaarxiv icon

Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages

Add code
Bookmark button
Alert button
Jan 27, 2022
Jivnesh Sandhan, Ayush Daksh, Om Adideva Paranjay, Laxmidhar Behera, Pawan Goyal

Figure 1 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Figure 2 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Figure 3 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Figure 4 for Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Viaarxiv icon

Revisiting Over-Smoothness in Text to Speech

Add code
Bookmark button
Alert button
Feb 26, 2022
Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu

Figure 1 for Revisiting Over-Smoothness in Text to Speech
Figure 2 for Revisiting Over-Smoothness in Text to Speech
Figure 3 for Revisiting Over-Smoothness in Text to Speech
Figure 4 for Revisiting Over-Smoothness in Text to Speech
Viaarxiv icon

Subject Enveloped Deep Sample Fuzzy Ensemble Learning Algorithm of Parkinson's Speech Data

Nov 17, 2021
Yiwen Wang, Fan Li, Xiaoheng Zhang, Pin Wang, Yongming Li

Figure 1 for Subject Enveloped Deep Sample Fuzzy Ensemble Learning Algorithm of Parkinson's Speech Data
Figure 2 for Subject Enveloped Deep Sample Fuzzy Ensemble Learning Algorithm of Parkinson's Speech Data
Figure 3 for Subject Enveloped Deep Sample Fuzzy Ensemble Learning Algorithm of Parkinson's Speech Data
Figure 4 for Subject Enveloped Deep Sample Fuzzy Ensemble Learning Algorithm of Parkinson's Speech Data
Viaarxiv icon

Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass

Feb 08, 2022
Olabanji Shonibare, Xiaosu Tong, Venkatesh Ravichandran

Viaarxiv icon

A Feature Extraction based Model for Hate Speech Identification

Add code
Bookmark button
Alert button
Jan 11, 2022
Salar Mohtaj, Vera Schmitt, Sebastian Möller

Figure 1 for A Feature Extraction based Model for Hate Speech Identification
Figure 2 for A Feature Extraction based Model for Hate Speech Identification
Figure 3 for A Feature Extraction based Model for Hate Speech Identification
Figure 4 for A Feature Extraction based Model for Hate Speech Identification
Viaarxiv icon

SpeechBrain: A General-Purpose Speech Toolkit

Add code
Bookmark button
Alert button
Jun 08, 2021
Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-Chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato De Mori, Yoshua Bengio

Figure 1 for SpeechBrain: A General-Purpose Speech Toolkit
Figure 2 for SpeechBrain: A General-Purpose Speech Toolkit
Figure 3 for SpeechBrain: A General-Purpose Speech Toolkit
Figure 4 for SpeechBrain: A General-Purpose Speech Toolkit
Viaarxiv icon

Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech

Jan 19, 2022
Kusha Sridhar, Carlos Busso

Figure 1 for Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
Figure 2 for Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
Figure 3 for Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
Figure 4 for Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
Viaarxiv icon

Non-Parallel Voice Conversion for ASR Augmentation

Sep 15, 2022
Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang, Jesse Emond, Pedro Moreno Mengibar

Figure 1 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 2 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 3 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 4 for Non-Parallel Voice Conversion for ASR Augmentation
Viaarxiv icon

A Survey on Neural Speech Synthesis

Add code
Bookmark button
Alert button
Jul 23, 2021
Xu Tan, Tao Qin, Frank Soong, Tie-Yan Liu

Figure 1 for A Survey on Neural Speech Synthesis
Figure 2 for A Survey on Neural Speech Synthesis
Figure 3 for A Survey on Neural Speech Synthesis
Figure 4 for A Survey on Neural Speech Synthesis
Viaarxiv icon