Alert button

"speech": models, code, and papers
Alert button

Expressive, Variable, and Controllable Duration Modelling in TTS

Jun 28, 2022
Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman

Figure 1 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 2 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 3 for Expressive, Variable, and Controllable Duration Modelling in TTS
Figure 4 for Expressive, Variable, and Controllable Duration Modelling in TTS
Viaarxiv icon

Speech enhancement with weakly labelled data from AudioSet

Feb 19, 2021
Qiuqiang Kong, Haohe Liu, Xingjian Du, Li Chen, Rui Xia, Yuxuan Wang

Figure 1 for Speech enhancement with weakly labelled data from AudioSet
Figure 2 for Speech enhancement with weakly labelled data from AudioSet
Figure 3 for Speech enhancement with weakly labelled data from AudioSet
Viaarxiv icon

More Speaking or More Speakers?

Add code
Bookmark button
Alert button
Nov 02, 2022
Dan Berrebbi, Ronan Collobert, Navdeep Jaitly, Tatiana Likhomanenko

Figure 1 for More Speaking or More Speakers?
Figure 2 for More Speaking or More Speakers?
Figure 3 for More Speaking or More Speakers?
Figure 4 for More Speaking or More Speakers?
Viaarxiv icon

Yunshan Cup 2020: Overview of the Part-of-Speech Tagging Task for Low-resourced Languages

Add code
Bookmark button
Alert button
Apr 06, 2022
Yingwen Fu, Jinyi Chen, Nankai Lin, Xixuan Huang, Xinying Qiu, Shengyi Jiang

Figure 1 for Yunshan Cup 2020: Overview of the Part-of-Speech Tagging Task for Low-resourced Languages
Figure 2 for Yunshan Cup 2020: Overview of the Part-of-Speech Tagging Task for Low-resourced Languages
Figure 3 for Yunshan Cup 2020: Overview of the Part-of-Speech Tagging Task for Low-resourced Languages
Figure 4 for Yunshan Cup 2020: Overview of the Part-of-Speech Tagging Task for Low-resourced Languages
Viaarxiv icon

Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi

Nov 18, 2022
Tharindu Ranasinghe, Kai North, Damith Premasiri, Marcos Zampieri

Figure 1 for Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi
Figure 2 for Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi
Figure 3 for Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi
Figure 4 for Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi
Viaarxiv icon

Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition

Add code
Bookmark button
Alert button
Mar 18, 2022
Marie Biolková, Bac Nguyen

Figure 1 for Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition
Figure 2 for Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition
Figure 3 for Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition
Figure 4 for Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition
Viaarxiv icon

Protecting gender and identity with disentangled speech representations

Apr 22, 2021
Dimitrios Stoidis, Andrea Cavallaro

Figure 1 for Protecting gender and identity with disentangled speech representations
Figure 2 for Protecting gender and identity with disentangled speech representations
Viaarxiv icon

Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease

May 31, 2021
Hao Fang, Chen Gong, Chen Zhang, Yanan Sui, Luming Li

Figure 1 for Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease
Figure 2 for Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease
Figure 3 for Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease
Figure 4 for Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease
Viaarxiv icon

Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition

Add code
Bookmark button
Alert button
Sep 19, 2021
Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin

Figure 1 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Figure 2 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Figure 3 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Figure 4 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Viaarxiv icon

Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding

Jul 21, 2022
Bagus Tris Atmaja, Zanjabila, Akira Sasou

Figure 1 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Figure 2 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Figure 3 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Figure 4 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Viaarxiv icon