Alert button

"speech recognition": models, code, and papers
Alert button

Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

Oct 01, 2023
Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh

Figure 1 for Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech
Figure 2 for Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech
Viaarxiv icon

Some voices are too common: Building fair speech recognition systems using the Common Voice dataset

Jun 01, 2023
Lucas Maison, Yannick Estève

Figure 1 for Some voices are too common: Building fair speech recognition systems using the Common Voice dataset
Figure 2 for Some voices are too common: Building fair speech recognition systems using the Common Voice dataset
Figure 3 for Some voices are too common: Building fair speech recognition systems using the Common Voice dataset
Figure 4 for Some voices are too common: Building fair speech recognition systems using the Common Voice dataset
Viaarxiv icon

Human Transcription Quality Improvement

Add code
Bookmark button
Alert button
Sep 24, 2023
Jian Gao, Hanbo Sun, Cheng Cao, Zheng Du

Figure 1 for Human Transcription Quality Improvement
Figure 2 for Human Transcription Quality Improvement
Figure 3 for Human Transcription Quality Improvement
Figure 4 for Human Transcription Quality Improvement
Viaarxiv icon

Distilling HuBERT with LSTMs via Decoupled Knowledge Distillation

Add code
Bookmark button
Alert button
Sep 18, 2023
Danilo de Oliveira, Timo Gerkmann

Viaarxiv icon

Memory-augmented conformer for improved end-to-end long-form ASR

Add code
Bookmark button
Alert button
Sep 22, 2023
Carlos Carvalho, Alberto Abad

Figure 1 for Memory-augmented conformer for improved end-to-end long-form ASR
Figure 2 for Memory-augmented conformer for improved end-to-end long-form ASR
Figure 3 for Memory-augmented conformer for improved end-to-end long-form ASR
Viaarxiv icon

HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model

Oct 06, 2023
Takashi Maekaku, Jiatong Shi, Xuankai Chang, Yuya Fujita, Shinji Watanabe

Figure 1 for HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
Figure 2 for HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
Figure 3 for HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
Figure 4 for HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
Viaarxiv icon

calamanCy: A Tagalog Natural Language Processing Toolkit

Add code
Bookmark button
Alert button
Nov 13, 2023
Lester James V. Miranda

Figure 1 for calamanCy: A Tagalog Natural Language Processing Toolkit
Figure 2 for calamanCy: A Tagalog Natural Language Processing Toolkit
Figure 3 for calamanCy: A Tagalog Natural Language Processing Toolkit
Figure 4 for calamanCy: A Tagalog Natural Language Processing Toolkit
Viaarxiv icon

AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR

Sep 30, 2023
Tobi Olatunji, Tejumade Afonja, Aditya Yadavalli, Chris Chinenye Emezue, Sahib Singh, Bonaventure F. P. Dossou, Joanne Osuchukwu, Salomey Osei, Atnafu Lambebo Tonja, Naome Etori, Clinton Mbataku

Figure 1 for AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR
Figure 2 for AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR
Figure 3 for AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR
Figure 4 for AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR
Viaarxiv icon

An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing

Add code
Bookmark button
Alert button
Jun 24, 2023
Lester Phillip Violeta, Tomoki Toda

Figure 1 for An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing
Figure 2 for An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing
Figure 3 for An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing
Figure 4 for An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing
Viaarxiv icon

DiaCorrect: Error Correction Back-end For Speaker Diarization

Add code
Bookmark button
Alert button
Sep 15, 2023
Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Diez, Lukas Burget, Yuhang Cao, Heng Lu, Jan Cernocky

Figure 1 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Figure 2 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Figure 3 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Figure 4 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Viaarxiv icon