speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Tagarela - A Portuguese speech dataset from podcasts

Add code
Mar 16, 2026
Viaarxiv icon

Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition

Add code
Mar 12, 2026
Viaarxiv icon

Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition

Add code
Mar 11, 2026
Viaarxiv icon

Huntington Disease Automatic Speech Recognition with Biomarker Supervision

Add code
Mar 11, 2026
Viaarxiv icon

Learnable Pulse Accumulation for On-Device Speech Recognition: How Much Attention Do You Need?

Add code
Mar 11, 2026
Viaarxiv icon

Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition

Add code
Mar 10, 2026
Viaarxiv icon

FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System

Add code
Mar 11, 2026
Viaarxiv icon

A Semi-spontaneous Dutch Speech Dataset for Speech Enhancement and Speech Recognition

Add code
Mar 10, 2026
Viaarxiv icon

Continued Pretraining for Low-Resource Swahili ASR: Achieving State-of-the-Art Performance with Minimal Labeled Data

Add code
Mar 11, 2026
Viaarxiv icon

Synthetic Data Domain Adaptation for ASR via LLM-based Text and Phonetic Respelling Augmentation

Add code
Mar 11, 2026
Viaarxiv icon