speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon

Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms

Add code
Jun 12, 2025
Viaarxiv icon

Unified Semi-Supervised Pipeline for Automatic Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia

Add code
Jun 10, 2025
Viaarxiv icon

Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments

Add code
Jun 17, 2025
Viaarxiv icon

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Add code
Jun 11, 2025
Viaarxiv icon

Speech Recognition on TV Series with Video-guided Post-Correction

Add code
Jun 08, 2025
Viaarxiv icon

Technical Report: A Practical Guide to Kaldi ASR Optimization

Add code
Jun 08, 2025
Viaarxiv icon

Automatic Speech Recognition of African American English: Lexical and Contextual Effects

Add code
Jun 07, 2025
Viaarxiv icon