Alert button

"speech": models, code, and papers
Alert button

Accurate synthesis of Dysarthric Speech for ASR data augmentation

Add code
Bookmark button
Alert button
Aug 16, 2023
Mohammad Soleymanpour, Michael T. Johnson, Rahim Soleymanpour, Jeffrey Berry

Figure 1 for Accurate synthesis of Dysarthric Speech for ASR data augmentation
Figure 2 for Accurate synthesis of Dysarthric Speech for ASR data augmentation
Figure 3 for Accurate synthesis of Dysarthric Speech for ASR data augmentation
Figure 4 for Accurate synthesis of Dysarthric Speech for ASR data augmentation
Viaarxiv icon

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech

Sep 11, 2023
Titouan Parcollet, Ha Nguyen, Solene Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jerome Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

Figure 1 for LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
Figure 2 for LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
Figure 3 for LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
Figure 4 for LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
Viaarxiv icon

Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment

Nov 07, 2023
Jakir Hasan, Shrestha Datta, Ameya Debnath

Viaarxiv icon

Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord

Sep 04, 2023
Jan Fillies, Silvio Peikert, Adrian Paschke

Figure 1 for Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord
Figure 2 for Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord
Figure 3 for Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord
Figure 4 for Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord
Viaarxiv icon

Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments

Sep 12, 2023
Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May

Viaarxiv icon

ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers

Aug 30, 2023
Yi Liu, Yuekang Li, Gelei Deng, Felix Juefei-Xu, Yao Du, Cen Zhang, Chengwei Liu, Yeting Li, Lei Ma, Yang Liu

Figure 1 for ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers
Figure 2 for ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers
Figure 3 for ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers
Figure 4 for ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers
Viaarxiv icon

M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models

Add code
Bookmark button
Alert button
Nov 19, 2023
Atin Sakkeer Hussain, Shansong Liu, Chenshuo Sun, Ying Shan

Figure 1 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Figure 2 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Figure 3 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Figure 4 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Viaarxiv icon

The DeepZen Speech Synthesis System for Blizzard Challenge 2023

Add code
Bookmark button
Alert button
Aug 30, 2023
Christophe Veaux, Ranniery Maia, Spyridoula Papendreou

Figure 1 for The DeepZen Speech Synthesis System for Blizzard Challenge 2023
Figure 2 for The DeepZen Speech Synthesis System for Blizzard Challenge 2023
Figure 3 for The DeepZen Speech Synthesis System for Blizzard Challenge 2023
Figure 4 for The DeepZen Speech Synthesis System for Blizzard Challenge 2023
Viaarxiv icon

CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset

Aug 31, 2023
Nayeon Lee, Chani Jung, Junho Myung, Jiho Jin, Juho Kim, Alice Oh

Figure 1 for CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset
Figure 2 for CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset
Figure 3 for CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset
Figure 4 for CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset
Viaarxiv icon

Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations

Add code
Bookmark button
Alert button
Aug 24, 2023
Wenbin Wang, Yang Song, Sanjay Jha

Figure 1 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Figure 2 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Figure 3 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Figure 4 for Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Viaarxiv icon