speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

LLM-based phoneme-to-grapheme for phoneme-based speech recognition

Add code
Jun 05, 2025
Viaarxiv icon

EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition

Add code
Jun 05, 2025
Viaarxiv icon

CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition

Add code
Jun 06, 2025
Viaarxiv icon

A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments

Add code
Jun 17, 2025
Viaarxiv icon

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon

A Survey of Retentive Network

Add code
Jun 07, 2025
Viaarxiv icon

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Add code
Jun 06, 2025
Viaarxiv icon

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

Add code
Jun 06, 2025
Viaarxiv icon

LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models

Add code
Jun 05, 2025
Viaarxiv icon