Alert button

"speech": models, code, and papers
Alert button

Fusion of Self-supervised Learned Models for MOS Prediction

Apr 11, 2022
Zhengdong Yang, Wangjin Zhou, Chenhui Chu, Sheng Li, Raj Dabre, Raphael Rubino, Yi Zhao

Figure 1 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 2 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 3 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 4 for Fusion of Self-supervised Learned Models for MOS Prediction
Viaarxiv icon

Audio-Based Deep Learning Frameworks for Detecting COVID-19

Mar 02, 2022
Dat Ngo, Lam Pham, Truong Hoang, Sefki Kolozali, Delaram Jarchi

Figure 1 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 2 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 3 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 4 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Viaarxiv icon

SVSNet: An End-to-end Speaker Voice Similarity Assessment Model

Jul 20, 2021
Cheng-Hung Hu, Yu-Huai Peng, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang

Figure 1 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Figure 2 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Figure 3 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Figure 4 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Viaarxiv icon

Speech recognition for medical conversations

Jun 20, 2018
Chung-Cheng Chiu, Anshuman Tripathi, Katherine Chou, Chris Co, Navdeep Jaitly, Diana Jaunzeikare, Anjuli Kannan, Patrick Nguyen, Hasim Sak, Ananth Sankar, Justin Tansuwan, Nathan Wan, Yonghui Wu, Xuedong Zhang

Figure 1 for Speech recognition for medical conversations
Figure 2 for Speech recognition for medical conversations
Figure 3 for Speech recognition for medical conversations
Viaarxiv icon

Speech based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model

Apr 09, 2021
Nadee Seneviratne, Carol Espy-Wilson

Figure 1 for Speech based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model
Figure 2 for Speech based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model
Figure 3 for Speech based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model
Figure 4 for Speech based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model
Viaarxiv icon

Emotional Voice Conversion using multitask learning with Text-to-speech

Add code
Bookmark button
Alert button
Nov 11, 2019
Tae-Ho Kim, Sungjae Cho, Shinkook Choi, Sejik Park, Soo-Young Lee

Figure 1 for Emotional Voice Conversion using multitask learning with Text-to-speech
Figure 2 for Emotional Voice Conversion using multitask learning with Text-to-speech
Figure 3 for Emotional Voice Conversion using multitask learning with Text-to-speech
Figure 4 for Emotional Voice Conversion using multitask learning with Text-to-speech
Viaarxiv icon

A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions

Oct 23, 2021
Zhor Benhafid, Kawthar Yasmine Zergat, Abderrahmane Amrouche

Figure 1 for A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions
Figure 2 for A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions
Figure 3 for A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions
Figure 4 for A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions
Viaarxiv icon

Are you wearing a mask? Improving mask detection from speech using augmentation by cycle-consistent GANs

Add code
Bookmark button
Alert button
Jun 17, 2020
Nicolae-Cătălin Ristea, Radu Tudor Ionescu

Figure 1 for Are you wearing a mask? Improving mask detection from speech using augmentation by cycle-consistent GANs
Figure 2 for Are you wearing a mask? Improving mask detection from speech using augmentation by cycle-consistent GANs
Figure 3 for Are you wearing a mask? Improving mask detection from speech using augmentation by cycle-consistent GANs
Figure 4 for Are you wearing a mask? Improving mask detection from speech using augmentation by cycle-consistent GANs
Viaarxiv icon

Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients

Jul 03, 2019
Nursadul Mamun, Soheil Khorram, John H. L. Hansen

Figure 1 for Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients
Figure 2 for Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients
Figure 3 for Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients
Figure 4 for Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients
Viaarxiv icon

Spatial Concept-Based Navigation with Human Speech Instructions via Probabilistic Inference on Bayesian Generative Model

Add code
Bookmark button
Alert button
Feb 18, 2020
Akira Taniguchi, Yoshinobu Hagiwara, Tadahiro Taniguchi, Tetsunari Inamura

Figure 1 for Spatial Concept-Based Navigation with Human Speech Instructions via Probabilistic Inference on Bayesian Generative Model
Figure 2 for Spatial Concept-Based Navigation with Human Speech Instructions via Probabilistic Inference on Bayesian Generative Model
Figure 3 for Spatial Concept-Based Navigation with Human Speech Instructions via Probabilistic Inference on Bayesian Generative Model
Figure 4 for Spatial Concept-Based Navigation with Human Speech Instructions via Probabilistic Inference on Bayesian Generative Model
Viaarxiv icon