speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains

Add code
Dec 22, 2025
Figure 1 for Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains
Figure 2 for Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains
Figure 3 for Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains
Figure 4 for Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains
Viaarxiv icon

Peeking Into The Future For Contextual Biasing

Add code
Dec 19, 2025
Figure 1 for Peeking Into The Future For Contextual Biasing
Figure 2 for Peeking Into The Future For Contextual Biasing
Figure 3 for Peeking Into The Future For Contextual Biasing
Figure 4 for Peeking Into The Future For Contextual Biasing
Viaarxiv icon

Reproducing and Dissecting Denoising Language Models for Speech Recognition

Add code
Dec 15, 2025
Viaarxiv icon

Semantic Codebooks as Effective Priors for Neural Speech Compression

Add code
Dec 25, 2025
Figure 1 for Semantic Codebooks as Effective Priors for Neural Speech Compression
Figure 2 for Semantic Codebooks as Effective Priors for Neural Speech Compression
Figure 3 for Semantic Codebooks as Effective Priors for Neural Speech Compression
Figure 4 for Semantic Codebooks as Effective Priors for Neural Speech Compression
Viaarxiv icon

When De-noising Hurts: A Systematic Study of Speech Enhancement Effects on Modern Medical ASR Systems

Add code
Dec 19, 2025
Viaarxiv icon

Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models

Add code
Dec 18, 2025
Figure 1 for Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models
Figure 2 for Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models
Figure 3 for Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models
Figure 4 for Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models
Viaarxiv icon

All-in-One ASR: Unifying Encoder-Decoder Models of CTC, Attention, and Transducer in Dual-Mode ASR

Add code
Dec 12, 2025
Viaarxiv icon

TRIDENT: A Redundant Architecture for Caribbean-Accented Emergency Speech Triage

Add code
Dec 11, 2025
Viaarxiv icon

EEG-to-Voice Decoding of Spoken and Imagined speech Using Non-Invasive EEG

Add code
Dec 14, 2025
Viaarxiv icon

A stylometric analysis of speaker attribution from speech transcripts

Add code
Dec 18, 2025
Viaarxiv icon