speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Add code
Jun 11, 2025
Viaarxiv icon

Hybrid Deep Learning and Signal Processing for Arabic Dialect Recognition in Low-Resource Settings

Add code
Jun 26, 2025
Viaarxiv icon

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Add code
Jun 06, 2025
Viaarxiv icon

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

Add code
Jun 06, 2025
Viaarxiv icon

A Survey of Retentive Network

Add code
Jun 07, 2025
Viaarxiv icon

Rhythm Features for Speaker Identification

Add code
Jun 07, 2025
Viaarxiv icon

Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation

Add code
Jul 09, 2025
Viaarxiv icon

A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments

Add code
Jun 17, 2025
Viaarxiv icon

From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars

Add code
Jun 16, 2025
Figure 1 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Figure 2 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Figure 3 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Figure 4 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Viaarxiv icon

Robust Unsupervised Adaptation of a Speech Recogniser Using Entropy Minimisation and Speaker Codes

Add code
Jun 12, 2025
Viaarxiv icon