Alert button

"speech": models, code, and papers
Alert button

VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature

Add code
Bookmark button
Alert button
Apr 02, 2022
Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu

Figure 1 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Figure 2 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Figure 3 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Figure 4 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Viaarxiv icon

Temporal envelope and fine structure cues for dysarthric speech detection using CNNs

Aug 25, 2021
Ina Kodrasi

Figure 1 for Temporal envelope and fine structure cues for dysarthric speech detection using CNNs
Figure 2 for Temporal envelope and fine structure cues for dysarthric speech detection using CNNs
Figure 3 for Temporal envelope and fine structure cues for dysarthric speech detection using CNNs
Figure 4 for Temporal envelope and fine structure cues for dysarthric speech detection using CNNs
Viaarxiv icon

Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization

Jul 09, 2021
Lu Zhang, Mingjiang Wang, Andong Li, Zehua Zhang, Xuyi Zhuang

Figure 1 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Figure 2 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Figure 3 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Figure 4 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Viaarxiv icon

Efficient acoustic feature transformation in mismatched environments using a Guided-GAN

Add code
Bookmark button
Alert button
Oct 06, 2022
Walter Heymans, Marelie H. Davel, Charl van Heerden

Figure 1 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Figure 2 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Figure 3 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Figure 4 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Viaarxiv icon

Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion

Add code
Bookmark button
Alert button
Jul 13, 2022
Jian Ma, Zhedong Zheng, Hao Fei, Feng Zheng, Tat-seng Chua, Yi Yang

Figure 1 for Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion
Figure 2 for Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion
Figure 3 for Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion
Figure 4 for Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion
Viaarxiv icon

Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization

May 26, 2021
Ashutosh Pandey, DeLiang Wang

Figure 1 for Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Figure 2 for Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Figure 3 for Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Figure 4 for Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Viaarxiv icon

Analysis of Disfluency in Children's Speech

Oct 08, 2020
Trang Tran, Morgan Tinkler, Gary Yeung, Abeer Alwan, Mari Ostendorf

Figure 1 for Analysis of Disfluency in Children's Speech
Figure 2 for Analysis of Disfluency in Children's Speech
Figure 3 for Analysis of Disfluency in Children's Speech
Figure 4 for Analysis of Disfluency in Children's Speech
Viaarxiv icon

Meta Auxiliary Learning for Low-resource Spoken Language Understanding

Jun 26, 2022
Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

Figure 1 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Figure 2 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Figure 3 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Figure 4 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Viaarxiv icon

It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability

Oct 18, 2022
Marco Landt-Hayen, Peer Kröger, Martin Claus, Willi Rath

Figure 1 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Figure 2 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Figure 3 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Figure 4 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Viaarxiv icon

Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss

Add code
Bookmark button
Alert button
Feb 05, 2022
Arka Mitra, Priyanshu Sankhala

Figure 1 for Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Figure 2 for Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Figure 3 for Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Viaarxiv icon