speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation

Add code
May 27, 2025
Viaarxiv icon

GMU Systems for the IWSLT 2025 Low-Resource Speech Translation Shared Task

Add code
May 27, 2025
Viaarxiv icon

Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition

Add code
May 23, 2025
Viaarxiv icon

Can Emotion Fool Anti-spoofing?

Add code
May 29, 2025
Viaarxiv icon

Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence

Add code
May 26, 2025
Viaarxiv icon

Towards One-bit ASR: Extremely Low-bit Conformer Quantization Using Co-training and Stochastic Precision

Add code
May 27, 2025
Viaarxiv icon

Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy

Add code
May 26, 2025
Viaarxiv icon

ZIPA: A family of efficient models for multilingual phone recognition

Add code
May 29, 2025
Viaarxiv icon

Developing a Top-tier Framework in Naturalistic Conditions Challenge for Categorized Emotion Prediction: From Speech Foundation Models and Learning Objective to Data Augmentation and Engineering Choices

Add code
May 28, 2025
Viaarxiv icon

An Effective Training Framework for Light-Weight Automatic Speech Recognition Models

Add code
May 22, 2025
Viaarxiv icon