speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Towards Robust Dysarthric Speech Recognition: LLM-Agent Post-ASR Correction Beyond WER

Add code
Jan 29, 2026
Viaarxiv icon

VowelPrompt: Hearing Speech Emotions from Text via Vowel-level Prosodic Augmentation

Add code
Feb 06, 2026
Viaarxiv icon

MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA

Add code
Feb 01, 2026
Viaarxiv icon

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection

Add code
Jan 28, 2026
Viaarxiv icon

Mići Princ -- A Little Boy Teaching Speech Technologies the Chakavian Dialect

Add code
Feb 03, 2026
Viaarxiv icon

WAXAL: A Large-Scale Multilingual African Language Speech Corpus

Add code
Feb 02, 2026
Viaarxiv icon

Do we really need Self-Attention for Streaming Automatic Speech Recognition?

Add code
Jan 27, 2026
Viaarxiv icon

SW-ASR: A Context-Aware Hybrid ASR Pipeline for Robust Single Word Speech Recognition

Add code
Jan 28, 2026
Viaarxiv icon

Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text

Add code
Feb 04, 2026
Viaarxiv icon

Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition

Add code
Jan 27, 2026
Viaarxiv icon