speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition

Add code
May 30, 2025
Viaarxiv icon

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Add code
Jun 06, 2025
Viaarxiv icon

LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models

Add code
Jun 05, 2025
Viaarxiv icon

A Survey of Retentive Network

Add code
Jun 07, 2025
Viaarxiv icon

On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition

Add code
May 28, 2025
Viaarxiv icon

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

Add code
Jun 06, 2025
Viaarxiv icon

Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation

Add code
Jul 09, 2025
Viaarxiv icon

Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition

Add code
May 28, 2025
Viaarxiv icon

Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR

Add code
Jun 04, 2025
Viaarxiv icon