Alert button

"speech": models, code, and papers
Alert button

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos

Mar 01, 2019
Egor Lakomkin, Sven Magg, Cornelius Weber, Stefan Wermter

Figure 1 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 2 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 3 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 4 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Viaarxiv icon

Eigenresiduals for improved Parametric Speech Synthesis

Jan 02, 2020
Thomas Drugman, Geoffrey Wilfart, Thierry Dutoit

Figure 1 for Eigenresiduals for improved Parametric Speech Synthesis
Figure 2 for Eigenresiduals for improved Parametric Speech Synthesis
Figure 3 for Eigenresiduals for improved Parametric Speech Synthesis
Figure 4 for Eigenresiduals for improved Parametric Speech Synthesis
Viaarxiv icon

Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages

Mar 17, 2022
Clarissa Forbes, Farhan Samir, Bruce Harold Oliver, Changbing Yang, Edith Coates, Garrett Nicolai, Miikka Silfverberg

Figure 1 for Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages
Figure 2 for Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages
Viaarxiv icon

FAIR4Cov: Fused Audio Instance and Representation for COVID-19 Detection

Apr 22, 2022
Tuan Truong, Matthias Lenga, Antoine Serrurier, Sadegh Mohammadi

Figure 1 for FAIR4Cov: Fused Audio Instance and Representation for COVID-19 Detection
Figure 2 for FAIR4Cov: Fused Audio Instance and Representation for COVID-19 Detection
Figure 3 for FAIR4Cov: Fused Audio Instance and Representation for COVID-19 Detection
Figure 4 for FAIR4Cov: Fused Audio Instance and Representation for COVID-19 Detection
Viaarxiv icon

DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech Enhancement

Dec 19, 2020
Huixiang Huang, Renjie Wu, Jingbiao Huang, Jucai Lin

Figure 1 for DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech Enhancement
Figure 2 for DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech Enhancement
Figure 3 for DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech Enhancement
Figure 4 for DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech Enhancement
Viaarxiv icon

WHALETRANS: E2E WHisper to nAturaL spEech conversion using modified TRANSformer network

Apr 20, 2020
Abhishek Niranjan, Mukesh Sharma, Sai Bharath Chandra Gutha, M Ali Basha Shaik

Figure 1 for WHALETRANS: E2E WHisper to nAturaL spEech conversion using modified TRANSformer network
Figure 2 for WHALETRANS: E2E WHisper to nAturaL spEech conversion using modified TRANSformer network
Figure 3 for WHALETRANS: E2E WHisper to nAturaL spEech conversion using modified TRANSformer network
Figure 4 for WHALETRANS: E2E WHisper to nAturaL spEech conversion using modified TRANSformer network
Viaarxiv icon

Text-Aware End-to-end Mispronunciation Detection and Diagnosis

Jun 15, 2022
Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang

Figure 1 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Figure 2 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Figure 3 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Figure 4 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Viaarxiv icon

Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network

Apr 16, 2020
Tifani Warnita, Mariana Rodrigues Makiuchi, Nakamasa Inoue, Koichi Shinoda, Michitaka Yoshimura, Momoko Kitazawa, Kei Funaki, Yoko Eguchi, Taishiro Kishimoto

Figure 1 for Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network
Figure 2 for Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network
Figure 3 for Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network
Figure 4 for Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network
Viaarxiv icon

Comparing acoustic analyses of speech data collected remotely

Mar 01, 2021
Cong Zhang, Kathleen Jepson, Georg Lohfink, Amalia Arvaniti

Figure 1 for Comparing acoustic analyses of speech data collected remotely
Figure 2 for Comparing acoustic analyses of speech data collected remotely
Figure 3 for Comparing acoustic analyses of speech data collected remotely
Figure 4 for Comparing acoustic analyses of speech data collected remotely
Viaarxiv icon

KUCST@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text

Apr 09, 2022
Manex Agirrezabal, Janek Amann

Figure 1 for KUCST@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text
Figure 2 for KUCST@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text
Figure 3 for KUCST@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text
Figure 4 for KUCST@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text
Viaarxiv icon