Alert button

"speech recognition": models, code, and papers
Alert button

Accented Speech Recognition With Accent-specific Codebooks

Add code
Bookmark button
Alert button
Oct 25, 2023
Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni

Figure 1 for Accented Speech Recognition With Accent-specific Codebooks
Figure 2 for Accented Speech Recognition With Accent-specific Codebooks
Figure 3 for Accented Speech Recognition With Accent-specific Codebooks
Figure 4 for Accented Speech Recognition With Accent-specific Codebooks
Viaarxiv icon

GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition

Add code
Bookmark button
Alert button
Nov 08, 2023
Daniel Galvez, Tim Kaldewey

Figure 1 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Figure 2 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Figure 3 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Figure 4 for GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Viaarxiv icon

Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions

Dec 21, 2023
Yang Liu, Haoqin Sun, Geng Chen, Qingyue Wang, Zhen Zhao, Xugang Lu, Longbiao Wang

Viaarxiv icon

Exploratory Evaluation of Speech Content Masking

Jan 08, 2024
Jennifer Williams, Karla Pizzi, Paul-Gauthier Noe, Sneha Das

Viaarxiv icon

Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models

Add code
Bookmark button
Alert button
Jan 15, 2024
Dan Jacobellis, Daniel Cummings, Neeraja J. Yadwadkar

Viaarxiv icon

Named Entity Recognition for Address Extraction in Speech-to-Text Transcriptions Using Synthetic Data

Feb 08, 2024
Bibiána Lajčinová, Patrik Valábek, Michal Spišiak

Viaarxiv icon

Part-of-Speech Tagger for Bodo Language using Deep Learning approach

Jan 06, 2024
Dhrubajyoti Pathak, Sanjib Narzary, Sukumar Nandi, Bidisha Som

Viaarxiv icon

Intelligibility prediction with a pretrained noise-robust automatic speech recognition model

Add code
Bookmark button
Alert button
Oct 20, 2023
Zehai Tu, Ning Ma, Jon Barker

Viaarxiv icon

Ms-senet: Enhancing Speech Emotion Recognition Through Multi-scale Feature Fusion With Squeeze-and-excitation Blocks

Dec 25, 2023
Mengbo Li, Yuanzhong Zheng, Dichucheng Li, Yulun Wu, Yaoxuan Wang, Haojun Fei

Viaarxiv icon

Self-Supervised Adaptive AV Fusion Module for Pre-Trained ASR Models

Dec 21, 2023
Christopher Simic, Tobias Bocklet

Viaarxiv icon