Alert button

"speech": models, code, and papers
Alert button

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Add code
Bookmark button
Alert button
Apr 03, 2019
Kyubyong Park, Thomas Mulc

Figure 1 for CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Figure 2 for CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Figure 3 for CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Figure 4 for CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Viaarxiv icon

Universal adversarial examples in speech command classification

Add code
Bookmark button
Alert button
Nov 26, 2019
Jon Vadillo, Roberto Santana

Figure 1 for Universal adversarial examples in speech command classification
Figure 2 for Universal adversarial examples in speech command classification
Figure 3 for Universal adversarial examples in speech command classification
Figure 4 for Universal adversarial examples in speech command classification
Viaarxiv icon

Self-training and Pre-training are Complementary for Speech Recognition

Add code
Bookmark button
Alert button
Oct 22, 2020
Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli

Figure 1 for Self-training and Pre-training are Complementary for Speech Recognition
Figure 2 for Self-training and Pre-training are Complementary for Speech Recognition
Figure 3 for Self-training and Pre-training are Complementary for Speech Recognition
Figure 4 for Self-training and Pre-training are Complementary for Speech Recognition
Viaarxiv icon

FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification

Add code
Bookmark button
Alert button
Aug 18, 2021
Jin Li, Nan Yan, Lan Wang

Figure 1 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Figure 2 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Figure 3 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Figure 4 for FDN: Finite Difference Network with Hierachical Convolutional Features for Text-independent Speaker verification
Viaarxiv icon

Parameter Tuning of Time-Frequency Masking Algorithms for Reverberant Artifact Removal within the Cochlear Implant Stimulus

Aug 12, 2021
Lidea K. Shahidi, Leslie M. Collins, Boyla O. Mainsah

Figure 1 for Parameter Tuning of Time-Frequency Masking Algorithms for Reverberant Artifact Removal within the Cochlear Implant Stimulus
Figure 2 for Parameter Tuning of Time-Frequency Masking Algorithms for Reverberant Artifact Removal within the Cochlear Implant Stimulus
Figure 3 for Parameter Tuning of Time-Frequency Masking Algorithms for Reverberant Artifact Removal within the Cochlear Implant Stimulus
Figure 4 for Parameter Tuning of Time-Frequency Masking Algorithms for Reverberant Artifact Removal within the Cochlear Implant Stimulus
Viaarxiv icon

Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework

Add code
Bookmark button
Alert button
Nov 07, 2019
Mingbo Ma, Baigong Zheng, Kaibo Liu, Renjie Zheng, Hairong Liu, Kainan Peng, Kenneth Church, Liang Huang

Figure 1 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 2 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 3 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 4 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Viaarxiv icon

Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition

Feb 16, 2021
Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Dong Yu

Figure 1 for Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Figure 2 for Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Figure 3 for Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Figure 4 for Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Viaarxiv icon

Data Augmentation for End-to-end Code-switching Speech Recognition

Nov 04, 2020
Chenpeng Du, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian

Figure 1 for Data Augmentation for End-to-end Code-switching Speech Recognition
Figure 2 for Data Augmentation for End-to-end Code-switching Speech Recognition
Figure 3 for Data Augmentation for End-to-end Code-switching Speech Recognition
Figure 4 for Data Augmentation for End-to-end Code-switching Speech Recognition
Viaarxiv icon

Noisy-to-Noisy Voice Conversion Framework with Denoising Model

Sep 22, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 2 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 3 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 4 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Viaarxiv icon

Towards Learning Universal Audio Representations

Add code
Bookmark button
Alert button
Nov 23, 2021
Luyu Wang, Pauline Luc, Yan Wu, Adria Recasens, Lucas Smaira, Andrew Brock, Andrew Jaegle, Jean-Baptiste Alayrac, Sander Dieleman, Joao Carreira, Aaron van den Oord

Figure 1 for Towards Learning Universal Audio Representations
Figure 2 for Towards Learning Universal Audio Representations
Figure 3 for Towards Learning Universal Audio Representations
Figure 4 for Towards Learning Universal Audio Representations
Viaarxiv icon