speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

BabAR: from phoneme recognition to developmental measures of young children's speech production

Add code
Mar 05, 2026
Viaarxiv icon

Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

Add code
Mar 05, 2026
Viaarxiv icon

An Investigation Into Various Approaches For Bengali Long-Form Speech Transcription and Bengali Speaker Diarization

Add code
Mar 03, 2026
Viaarxiv icon

Towards Orthographically-Informed Evaluation of Speech Recognition Systems for Indian Languages

Add code
Mar 01, 2026
Viaarxiv icon

More Data, Fewer Diacritics: Scaling Arabic TTS

Add code
Mar 02, 2026
Viaarxiv icon

DARS: Dysarthria-Aware Rhythm-Style Synthesis for ASR Enhancement

Add code
Mar 02, 2026
Viaarxiv icon

RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks

Add code
Mar 02, 2026
Viaarxiv icon

GLoRIA: Gated Low-Rank Interpretable Adaptation for Dialectal ASR

Add code
Mar 02, 2026
Viaarxiv icon

ACES: Accent Subspaces for Coupling, Explanations, and Stress-Testing in Automatic Speech Recognition

Add code
Feb 28, 2026
Viaarxiv icon

The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge

Add code
Mar 02, 2026
Viaarxiv icon