Alert button

"speech": models, code, and papers
Alert button

Leveraging neural representations for facilitating access to untranscribed speech from endangered languages

Add code
Bookmark button
Alert button
Mar 26, 2021
Nay San, Martijn Bartelds, Mitchell Browne, Lily Clifford, Fiona Gibson, John Mansfield, David Nash, Jane Simpson, Myfany Turpin, Maria Vollmer, Sasha Wilmoth, Dan Jurafsky

Figure 1 for Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
Figure 2 for Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
Figure 3 for Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
Viaarxiv icon

Voice Conversion for Whispered Speech Synthesis

Jan 17, 2020
Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, Alexis Moinet

Figure 1 for Voice Conversion for Whispered Speech Synthesis
Figure 2 for Voice Conversion for Whispered Speech Synthesis
Figure 3 for Voice Conversion for Whispered Speech Synthesis
Figure 4 for Voice Conversion for Whispered Speech Synthesis
Viaarxiv icon

The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task

Add code
Bookmark button
Alert button
Jul 08, 2021
Chen Xu, Xiaoqian Liu, Xiaowen Liu, Laohu Wang, Canan Huang, Tong Xiao, Jingbo Zhu

Figure 1 for The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
Figure 2 for The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
Figure 3 for The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
Figure 4 for The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
Viaarxiv icon

LibriVoxDeEn: A Corpus for German-to-English Speech Translation and Speech Recognition

Add code
Bookmark button
Alert button
Oct 18, 2019
Benjamin Beilharz, Xin Sun, Sariya Karimova, Stefan Riezler

Figure 1 for LibriVoxDeEn: A Corpus for German-to-English Speech Translation and Speech Recognition
Figure 2 for LibriVoxDeEn: A Corpus for German-to-English Speech Translation and Speech Recognition
Figure 3 for LibriVoxDeEn: A Corpus for German-to-English Speech Translation and Speech Recognition
Figure 4 for LibriVoxDeEn: A Corpus for German-to-English Speech Translation and Speech Recognition
Viaarxiv icon

Streaming non-autoregressive model for any-to-many voice conversion

Add code
Bookmark button
Alert button
Jun 15, 2022
Ziyi Chen, Haoran Miao, Pengyuan Zhang

Figure 1 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 2 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 3 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 4 for Streaming non-autoregressive model for any-to-many voice conversion
Viaarxiv icon

Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech

Nov 28, 2019
Vatsal Aggarwal, Marius Cotescu, Nishant Prateek, Jaime Lorenzo-Trueba, Roberto Barra-Chicote

Figure 1 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Figure 2 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Figure 3 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Figure 4 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Viaarxiv icon

Data augmentation using prosody and false starts to recognize non-native children's speech

Add code
Bookmark button
Alert button
Aug 29, 2020
Hemant Kathania, Mittul Singh, Tamás Grósz, Mikko Kurimo

Figure 1 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 2 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 3 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 4 for Data augmentation using prosody and false starts to recognize non-native children's speech
Viaarxiv icon

Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation

Add code
Bookmark button
Alert button
Aug 11, 2021
Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram

Figure 1 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Figure 2 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Figure 3 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Figure 4 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Viaarxiv icon

HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection

Add code
Bookmark button
Alert button
Jan 22, 2021
Suman Dowlagar, Radhika Mamidi

Figure 1 for HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection
Figure 2 for HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection
Figure 3 for HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection
Figure 4 for HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection
Viaarxiv icon

POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling

Add code
Bookmark button
Alert button
Sep 07, 2021
Zeyang Liu, Ke Zhou, Jiaxin Mao, Max L. Wilson

Figure 1 for POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling
Figure 2 for POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling
Figure 3 for POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling
Figure 4 for POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling
Viaarxiv icon