speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

A Study on Regularization-Based Continual Learning Methods for Indic ASR

Add code
Aug 08, 2025
Viaarxiv icon

Speech LLMs in Low-Resource Scenarios: Data Volume Requirements and the Impact of Pretraining on High-Resource Languages

Add code
Aug 07, 2025
Viaarxiv icon

Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment

Add code
Aug 11, 2025
Viaarxiv icon

Efficient Scaling for LLM-based ASR

Add code
Aug 06, 2025
Viaarxiv icon

NVSpeech: An Integrated and Scalable Pipeline for Human-Like Speech Modeling with Paralinguistic Vocalizations

Add code
Aug 06, 2025
Viaarxiv icon

MiDashengLM: Efficient Audio Understanding with General Audio Captions

Add code
Aug 06, 2025
Viaarxiv icon

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon

Multi-Target Backdoor Attacks Against Speaker Recognition

Add code
Aug 13, 2025
Viaarxiv icon

Identifying Hearing Difficulty Moments in Conversational Audio

Add code
Jul 31, 2025
Viaarxiv icon

The Interspeech 2025 Speech Accessibility Project Challenge

Add code
Jul 29, 2025
Viaarxiv icon