speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Towards Personalized Federated Learning for Dysarthric Speech Recognition

Add code
Jun 11, 2026
Viaarxiv icon

Balancing ASR and diarization in end-to-end LLMs for multi-talker speech recognition

Add code
Jun 11, 2026
Viaarxiv icon

Positional Encoding in the Context of Memristor-Based Analog Computation for Automatic Speech Recognition

Add code
Jun 11, 2026
Viaarxiv icon

PiDA: Phonetically-Informed Data Augmentation for Robust Vietnamese Speech Translation

Add code
Jun 11, 2026
Viaarxiv icon

Ontology Memory-Augmented ASR Correction for Long Text-Speech Interleaved Conversations

Add code
Jun 11, 2026
Viaarxiv icon

Evaluating Bias in Phoneme-Based Automatic Speech Recognition Systems: An Analysis of IPA Transcription Models

Add code
Jun 10, 2026
Viaarxiv icon

Tight Boundary Prediction in Speaker Diarization Using Causal-Anticausal Consistency

Add code
Jun 10, 2026
Viaarxiv icon

Pretrained self-supervised speech models can recognize unseen consonants

Add code
Jun 10, 2026
Viaarxiv icon

Speech Encoder Fusion for LLM-based Automatic Speech Recognition

Add code
Jun 09, 2026
Viaarxiv icon

Phoneme-First Prediction for LLM-Based Speech Recognition

Add code
Jun 09, 2026
Viaarxiv icon