speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia

Add code
Feb 16, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset

Add code
Feb 15, 2026
Viaarxiv icon

Investigation for Relative Voice Impression Estimation

Add code
Feb 15, 2026
Viaarxiv icon

Speech to Speech Synthesis for Voice Impersonation

Add code
Feb 13, 2026
Viaarxiv icon

voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models

Add code
Feb 14, 2026
Viaarxiv icon

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Add code
Feb 13, 2026
Viaarxiv icon

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Add code
Feb 13, 2026
Viaarxiv icon

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Add code
Feb 13, 2026
Viaarxiv icon

PISHYAR: A Socially Intelligent Smart Cane for Indoor Social Navigation and Multimodal Human-Robot Interaction for Visually Impaired People

Add code
Feb 13, 2026
Viaarxiv icon