speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition Models

Add code
Jul 10, 2025
Viaarxiv icon

Audio-Vision Contrastive Learning for Phonological Class Recognition

Add code
Jul 23, 2025
Viaarxiv icon

VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis

Add code
Jul 08, 2025
Viaarxiv icon

StylOch at PAN: Gradient-Boosted Trees with Frequency-Based Stylometric Features

Add code
Jul 16, 2025
Viaarxiv icon

End-to-end Acoustic-linguistic Emotion and Intent Recognition Enhanced by Semi-supervised Learning

Add code
Jul 10, 2025
Viaarxiv icon

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

Add code
Jul 16, 2025
Viaarxiv icon

A Novel Hybrid Deep Learning Technique for Speech Emotion Detection using Feature Engineering

Add code
Jul 09, 2025
Viaarxiv icon

Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla

Add code
Jul 02, 2025
Viaarxiv icon

A Cookbook for Community-driven Data Collection of Impaired Speech in LowResource Languages

Add code
Jul 03, 2025
Viaarxiv icon

AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance

Add code
Jul 02, 2025
Viaarxiv icon