Alert button

"speech": models, code, and papers
Alert button

Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement

Add code
Bookmark button
Alert button
May 26, 2020
Dongyang Dai, Li Chen, Yuping Wang, Mu Wang, Rui Xia, Xuchen Song, Zhiyong Wu, Yuxuan Wang

Figure 1 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Figure 2 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Figure 3 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Figure 4 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Viaarxiv icon

Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation

Add code
Bookmark button
Alert button
Apr 09, 2021
Yueyue Na, Ziteng Wang, Zhang Liu, Biao Tian, Qiang Fu

Figure 1 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Figure 2 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Figure 3 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Figure 4 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Viaarxiv icon

Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language

Add code
Bookmark button
Alert button
Feb 19, 2020
Kohei Matsuura, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

Figure 1 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 2 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 3 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 4 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Viaarxiv icon

Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech

May 09, 2019
Tobias Menne, Ilya Sklyar, Ralf Schlüter, Hermann Ney

Figure 1 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 2 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 3 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 4 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Viaarxiv icon

AutoDiCE: Fully Automated Distributed CNN Inference at the Edge

Jul 20, 2022
Xiaotian Guo, Andy D. Pimentel, Todor Stefanov

Figure 1 for AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
Figure 2 for AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
Figure 3 for AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
Figure 4 for AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
Viaarxiv icon

Multitask Training with Text Data for End-to-End Speech Recognition

Oct 27, 2020
Peidong Wang, Tara N. Sainath, Ron J. Weiss

Figure 1 for Multitask Training with Text Data for End-to-End Speech Recognition
Figure 2 for Multitask Training with Text Data for End-to-End Speech Recognition
Figure 3 for Multitask Training with Text Data for End-to-End Speech Recognition
Figure 4 for Multitask Training with Text Data for End-to-End Speech Recognition
Viaarxiv icon

Comparing acoustic analyses of speech data collected remotely

Add code
Bookmark button
Alert button
Mar 01, 2021
Cong Zhang, Kathleen Jepson, Georg Lohfink, Amalia Arvaniti

Figure 1 for Comparing acoustic analyses of speech data collected remotely
Figure 2 for Comparing acoustic analyses of speech data collected remotely
Figure 3 for Comparing acoustic analyses of speech data collected remotely
Figure 4 for Comparing acoustic analyses of speech data collected remotely
Viaarxiv icon

Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech

Add code
Bookmark button
Alert button
May 14, 2022
Joonas Kalda, Tanel Alumäe

Figure 1 for Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Figure 2 for Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Figure 3 for Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Figure 4 for Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
Viaarxiv icon

Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency

Add code
Bookmark button
Alert button
Jul 02, 2022
Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu

Figure 1 for Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency
Figure 2 for Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency
Figure 3 for Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency
Figure 4 for Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency
Viaarxiv icon

SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

Add code
Bookmark button
Alert button
Feb 27, 2020
Arya D. McCarthy, Liezl Puzon, Juan Pino

Figure 1 for SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
Figure 2 for SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
Figure 3 for SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
Figure 4 for SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation
Viaarxiv icon