Speech Recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

TinyML for Speech Recognition

Add code
Apr 22, 2025
Viaarxiv icon

PsyCounAssist: A Full-Cycle AI-Powered Psychological Counseling Assistant System

Add code
Apr 23, 2025
Viaarxiv icon

Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides

Add code
Apr 21, 2025
Viaarxiv icon

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Add code
Apr 22, 2025
Viaarxiv icon

StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models

Add code
Apr 21, 2025
Viaarxiv icon

SoCov: Semi-Orthogonal Parametric Pooling of Covariance Matrix for Speaker Recognition

Add code
Apr 23, 2025
Viaarxiv icon

Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning

Add code
Apr 16, 2025
Viaarxiv icon

Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope

Add code
Apr 17, 2025
Viaarxiv icon

Dysarthria Normalization via Local Lie Group Transformations for Robust ASR

Add code
Apr 16, 2025
Viaarxiv icon

Real-Time Word-Level Temporal Segmentation in Streaming Speech Recognition

Add code
Apr 15, 2025
Viaarxiv icon