speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech?

Add code
Aug 28, 2025
Viaarxiv icon

Talking to Robots: A Practical Examination of Speech Foundation Models for HRI Applications

Add code
Aug 25, 2025
Figure 1 for Talking to Robots: A Practical Examination of Speech Foundation Models for HRI Applications
Viaarxiv icon

Attention2Probability: Attention-Driven Terminology Probability Estimation for Robust Speech-to-Text System

Add code
Aug 26, 2025
Viaarxiv icon

Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation

Add code
Aug 25, 2025
Viaarxiv icon

HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization

Add code
Aug 17, 2025
Viaarxiv icon

Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study

Add code
Aug 25, 2025
Viaarxiv icon

UniCoM: A Universal Code-Switching Speech Generator

Add code
Aug 21, 2025
Viaarxiv icon

Joint decoding method for controllable contextual speech recognition based on Speech LLM

Add code
Aug 12, 2025
Figure 1 for Joint decoding method for controllable contextual speech recognition based on Speech LLM
Figure 2 for Joint decoding method for controllable contextual speech recognition based on Speech LLM
Figure 3 for Joint decoding method for controllable contextual speech recognition based on Speech LLM
Figure 4 for Joint decoding method for controllable contextual speech recognition based on Speech LLM
Viaarxiv icon

Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

Add code
Sep 03, 2025
Viaarxiv icon

EmoTale: An Enacted Speech-emotion Dataset in Danish

Add code
Aug 20, 2025
Viaarxiv icon