speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis

Add code
May 28, 2025
Viaarxiv icon

Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation

Add code
May 27, 2025
Viaarxiv icon

Can Emotion Fool Anti-spoofing?

Add code
May 29, 2025
Viaarxiv icon

GMU Systems for the IWSLT 2025 Low-Resource Speech Translation Shared Task

Add code
May 27, 2025
Viaarxiv icon

Fine-Tuning Video Transformers for Word-Level Bangla Sign Language: A Comparative Analysis for Classification Tasks

Add code
Jun 04, 2025
Viaarxiv icon

ZIPA: A family of efficient models for multilingual phone recognition

Add code
May 29, 2025
Viaarxiv icon

Towards One-bit ASR: Extremely Low-bit Conformer Quantization Using Co-training and Stochastic Precision

Add code
May 27, 2025
Viaarxiv icon

Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence

Add code
May 26, 2025
Viaarxiv icon

Developing a Top-tier Framework in Naturalistic Conditions Challenge for Categorized Emotion Prediction: From Speech Foundation Models and Learning Objective to Data Augmentation and Engineering Choices

Add code
May 28, 2025
Viaarxiv icon

Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition

Add code
May 23, 2025
Viaarxiv icon