Alert button

"speech": models, code, and papers
Alert button

Review of end-to-end speech synthesis technology based on deep learning

Add code
Bookmark button
Alert button
Apr 20, 2021
Zhaoxi Mu, Xinyu Yang, Yizhuo Dong

Figure 1 for Review of end-to-end speech synthesis technology based on deep learning
Figure 2 for Review of end-to-end speech synthesis technology based on deep learning
Figure 3 for Review of end-to-end speech synthesis technology based on deep learning
Figure 4 for Review of end-to-end speech synthesis technology based on deep learning
Viaarxiv icon

Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism

Aug 31, 2021
Yoshihiko Nankaku, Kenta Sumiya, Takenori Yoshimura, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Keiichi Tokuda

Figure 1 for Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism
Figure 2 for Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism
Figure 3 for Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism
Viaarxiv icon

Can We Trust Deep Speech Prior?

Add code
Bookmark button
Alert button
Nov 04, 2020
Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han

Figure 1 for Can We Trust Deep Speech Prior?
Figure 2 for Can We Trust Deep Speech Prior?
Figure 3 for Can We Trust Deep Speech Prior?
Figure 4 for Can We Trust Deep Speech Prior?
Viaarxiv icon

Mega: Moving Average Equipped Gated Attention

Add code
Bookmark button
Alert button
Sep 26, 2022
Xuezhe Ma, Chunting Zhou, Xiang Kong, Junxian He, Liangke Gui, Graham Neubig, Jonathan May, Luke Zettlemoyer

Figure 1 for Mega: Moving Average Equipped Gated Attention
Figure 2 for Mega: Moving Average Equipped Gated Attention
Figure 3 for Mega: Moving Average Equipped Gated Attention
Figure 4 for Mega: Moving Average Equipped Gated Attention
Viaarxiv icon

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

Add code
Bookmark button
Alert button
Jun 02, 2021
Devaraja Adiga, Rishabh Kumar, Amrith Krishna, Preethi Jyothi, Ganesh Ramakrishnan, Pawan Goyal

Figure 1 for Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
Figure 2 for Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
Figure 3 for Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
Figure 4 for Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
Viaarxiv icon

Language-Independent Approach for Automatic Computation of Vowel Articulation Features in Dysarthric Speech Assessment

Add code
Bookmark button
Alert button
Aug 17, 2021
Yuanyuan Liu, Nelly Penttilä, Tiina Ihalainen, Juulia Lintula, Rachel Convey, Okko Räsänen

Figure 1 for Language-Independent Approach for Automatic Computation of Vowel Articulation Features in Dysarthric Speech Assessment
Figure 2 for Language-Independent Approach for Automatic Computation of Vowel Articulation Features in Dysarthric Speech Assessment
Figure 3 for Language-Independent Approach for Automatic Computation of Vowel Articulation Features in Dysarthric Speech Assessment
Figure 4 for Language-Independent Approach for Automatic Computation of Vowel Articulation Features in Dysarthric Speech Assessment
Viaarxiv icon

3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment

Aug 19, 2022
Fu-An Chao, Tien-Hong Lo, Tzu-I Wu, Yao-Ting Sung, Berlin Chen

Figure 1 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Figure 2 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Figure 3 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Figure 4 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Viaarxiv icon

Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser

Add code
Bookmark button
Alert button
Apr 08, 2022
Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesus Villalba, Sanjeev Khudanpur, Najim Dehak

Figure 1 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 2 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 3 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 4 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Viaarxiv icon

Digital Voicing of Silent Speech

Add code
Bookmark button
Alert button
Oct 06, 2020
David Gaddy, Dan Klein

Figure 1 for Digital Voicing of Silent Speech
Figure 2 for Digital Voicing of Silent Speech
Figure 3 for Digital Voicing of Silent Speech
Figure 4 for Digital Voicing of Silent Speech
Viaarxiv icon