Alert button

"speech": models, code, and papers
Alert button

AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation

Add code
Bookmark button
Alert button
Jun 01, 2022
Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su

Figure 1 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Figure 2 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Figure 3 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Figure 4 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Viaarxiv icon

The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology

Add code
Bookmark button
Alert button
Aug 21, 2022
Jesin James, Isabella Shields, Vithya Yogarajan, Peter J. Keegan, Catherine Watson, Peter-Lucas Jones, Keoni Mahelona

Figure 1 for The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology
Figure 2 for The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology
Figure 3 for The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology
Figure 4 for The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology
Viaarxiv icon

Gender Representation in Open Source Speech Resources

Add code
Bookmark button
Alert button
Mar 18, 2020
Mahault Garnerin, Solange Rossato, Laurent Besacier

Figure 1 for Gender Representation in Open Source Speech Resources
Figure 2 for Gender Representation in Open Source Speech Resources
Figure 3 for Gender Representation in Open Source Speech Resources
Figure 4 for Gender Representation in Open Source Speech Resources
Viaarxiv icon

Arabic Code-Switching Speech Recognition using Monolingual Data

Jul 04, 2021
Ahmed Ali, Shammur Chowdhury, Amir Hussein, Yasser Hifny

Figure 1 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 2 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 3 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 4 for Arabic Code-Switching Speech Recognition using Monolingual Data
Viaarxiv icon

Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition

Add code
Bookmark button
Alert button
Jun 07, 2022
Guangke Chen, Zhe Zhao, Fu Song, Sen Chen, Lingling Fan, Feng Wang, Jiashui Wang

Figure 1 for Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Figure 2 for Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Figure 3 for Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Figure 4 for Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Viaarxiv icon

Controllable neural text-to-speech synthesis using intuitive prosodic features

Add code
Bookmark button
Alert button
Sep 14, 2020
Tuomo Raitio, Ramya Rasipuram, Dan Castellani

Figure 1 for Controllable neural text-to-speech synthesis using intuitive prosodic features
Figure 2 for Controllable neural text-to-speech synthesis using intuitive prosodic features
Figure 3 for Controllable neural text-to-speech synthesis using intuitive prosodic features
Figure 4 for Controllable neural text-to-speech synthesis using intuitive prosodic features
Viaarxiv icon

DenseShift: Towards Accurate and Transferable Low-Bit Shift Network

Aug 20, 2022
Xinlin Li, Bang Liu, Rui Heng Yang, Vanessa Courville, Chao Xing, Vahid Partovi Nia

Figure 1 for DenseShift: Towards Accurate and Transferable Low-Bit Shift Network
Figure 2 for DenseShift: Towards Accurate and Transferable Low-Bit Shift Network
Figure 3 for DenseShift: Towards Accurate and Transferable Low-Bit Shift Network
Figure 4 for DenseShift: Towards Accurate and Transferable Low-Bit Shift Network
Viaarxiv icon

Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model

Add code
Bookmark button
Alert button
Apr 07, 2022
Nick J. C. Wang, Lu Wang, Yandan Sun, Haimei Kang, Dejun Zhang

Figure 1 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 2 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 3 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 4 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Viaarxiv icon

Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition

Sep 06, 2021
Arash Dehghani, Seyyed Ali Seyyedsalehi

Figure 1 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Figure 2 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Figure 3 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Figure 4 for Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Viaarxiv icon

Bilingual by default: Voice Assistants and the role of code-switching in creating a bilingual user experience

Jun 20, 2022
Helin Cihan, Yunhan Wu, Paola Peña, Justin Edwards, Benjamin Cowan

Viaarxiv icon