speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia

Add code
Feb 16, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset

Add code
Feb 15, 2026
Viaarxiv icon

Investigation for Relative Voice Impression Estimation

Add code
Feb 15, 2026
Viaarxiv icon

voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models

Add code
Feb 14, 2026
Viaarxiv icon

Speech to Speech Synthesis for Voice Impersonation

Add code
Feb 13, 2026
Viaarxiv icon

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Add code
Feb 13, 2026
Viaarxiv icon

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Add code
Feb 13, 2026
Viaarxiv icon

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Add code
Feb 13, 2026
Viaarxiv icon

PISHYAR: A Socially Intelligent Smart Cane for Indoor Social Navigation and Multimodal Human-Robot Interaction for Visually Impaired People

Add code
Feb 13, 2026
Viaarxiv icon