Alert button

"speech recognition": models, code, and papers
Alert button

Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors

Oct 25, 2023
Marek Kubis, Paweł Skórzewski, Marcin Sowański, Tomasz Ziętkiewicz

Figure 1 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Figure 2 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Figure 3 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Figure 4 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Viaarxiv icon

Accented Speech Recognition With Accent-specific Codebooks

Oct 27, 2023
Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni

Figure 1 for Accented Speech Recognition With Accent-specific Codebooks
Figure 2 for Accented Speech Recognition With Accent-specific Codebooks
Figure 3 for Accented Speech Recognition With Accent-specific Codebooks
Figure 4 for Accented Speech Recognition With Accent-specific Codebooks
Viaarxiv icon

Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder

Oct 06, 2023
Zih-Jyun Lin, Yi-Ju Chen, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen

Figure 1 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Figure 2 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Figure 3 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Figure 4 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Viaarxiv icon

Nonlinear functional regression by functional deep neural network with kernel embedding

Jan 05, 2024
Zhongjie Shi, Jun Fan, Linhao Song, Ding-Xuan Zhou, Johan A. K. Suykens

Viaarxiv icon

GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition

Nov 08, 2023
Daniel Galvez, Tim Kaldewey

Figure 1 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Figure 2 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Figure 3 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Figure 4 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Viaarxiv icon

TemporalAugmenter: An Ensemble Recurrent Based Deep Learning Approach for Signal Classification

Jan 13, 2024
Nelly Elsayed, Constantinos L. Zekios, Navid Asadizanjani, Zag ElSayed

Viaarxiv icon

SpokesBiz -- an Open Corpus of Conversational Polish

Dec 19, 2023
Piotr Pęzik, Sylwia Karasińska, Anna Cichosz, Łukasz Jałowiecki, Konrad Kaczyński, Małgorzata Krawentek, Karolina Walkusz, Paweł Wilk, Mariusz Kleć, Krzysztof Szklanny, Szymon Marszałkowski

Viaarxiv icon

Leveraged Mel spectrograms using Harmonic and Percussive Components in Speech Emotion Recognition

Dec 18, 2023
David Hason Rudd, Huan Huo, Guandong Xu

Viaarxiv icon

Keyword spotting -- Detecting commands in speech using deep learning

Dec 09, 2023
Sumedha Rai, Tong Li, Bella Lyu

Figure 1 for Keyword spotting -- Detecting commands in speech using deep learning
Figure 2 for Keyword spotting -- Detecting commands in speech using deep learning
Figure 3 for Keyword spotting -- Detecting commands in speech using deep learning
Figure 4 for Keyword spotting -- Detecting commands in speech using deep learning
Viaarxiv icon