speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Add code
Feb 17, 2026
Viaarxiv icon

Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Add code
Feb 17, 2026
Viaarxiv icon

Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics

Add code
Feb 25, 2026
Viaarxiv icon

"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most

Add code
Feb 16, 2026
Viaarxiv icon

Assessing the Impact of Speaker Identity in Speech Spoofing Detection

Add code
Feb 24, 2026
Viaarxiv icon

CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia

Add code
Feb 16, 2026
Viaarxiv icon

Speech to Speech Synthesis for Voice Impersonation

Add code
Feb 13, 2026
Viaarxiv icon

ViSpeechFormer: A Phonemic Approach for Vietnamese Automatic Speech Recognition

Add code
Feb 10, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset

Add code
Feb 15, 2026
Viaarxiv icon