speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

A Novel Hybrid Deep Learning Technique for Speech Emotion Detection using Feature Engineering

Add code
Jul 09, 2025
Viaarxiv icon

Speech Tokenizer is Key to Consistent Representation

Add code
Jul 09, 2025
Viaarxiv icon

VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis

Add code
Jul 08, 2025
Viaarxiv icon

Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation

Add code
Jul 09, 2025
Viaarxiv icon

A Cookbook for Community-driven Data Collection of Impaired Speech in LowResource Languages

Add code
Jul 03, 2025
Viaarxiv icon

Benchmarking Akan ASR Models Across Domain-Specific Datasets: A Comparative Evaluation of Performance, Scalability, and Adaptability

Add code
Jul 03, 2025
Viaarxiv icon

Open-Source System for Multilingual Translation and Cloned Speech Synthesis

Add code
Jul 03, 2025
Viaarxiv icon

Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla

Add code
Jul 02, 2025
Viaarxiv icon

AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance

Add code
Jul 02, 2025
Viaarxiv icon

First Steps Towards Voice Anonymization for Code-Switching Speech

Add code
Jul 02, 2025
Viaarxiv icon