speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Cross-Modal Bottleneck Fusion For Noise Robust Audio-Visual Speech Recognition

Add code
Feb 09, 2026
Viaarxiv icon

"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most

Add code
Feb 16, 2026
Viaarxiv icon

CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia

Add code
Feb 16, 2026
Viaarxiv icon

Where Are We At with Automatic Speech Recognition for the Bambara Language?

Add code
Feb 10, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset

Add code
Feb 15, 2026
Viaarxiv icon

voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models

Add code
Feb 14, 2026
Viaarxiv icon

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Add code
Feb 13, 2026
Viaarxiv icon

Investigation for Relative Voice Impression Estimation

Add code
Feb 15, 2026
Viaarxiv icon

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Add code
Feb 13, 2026
Viaarxiv icon