speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

Add code
Jun 17, 2026
Viaarxiv icon

Low-resource Language Discrimination Towards Chinese Dialects with Transfer learning and Data Augmentation

Add code
Jun 17, 2026
Viaarxiv icon

An Analysis of the Effectiveness of Synthetic Speech Data for ASR Fine-tuning in Selected Indic Languages

Add code
Jun 16, 2026
Viaarxiv icon

When Multiple Scripts Matter: Evaluating ASR in Clinical Settings

Add code
Jun 16, 2026
Viaarxiv icon

Improving low-resource ASR using bilingual fine-tuning with language identification: a cross-linguistic evaluation

Add code
Jun 16, 2026
Viaarxiv icon

Decoding while Adapting: Zero-Shot Online Speaker Adaptation via Audio-Textual Prompts for Elderly Speech Recognition

Add code
Jun 15, 2026
Viaarxiv icon

Confidence Score Guided Incremental and Speaker Adaptive Pseudo-Labeling for Semi-Supervised Elderly Speech Recognition

Add code
Jun 15, 2026
Viaarxiv icon

ASTRA: A Scalable Next-Generation ATCO Training Simulator with Autonomous Simpilots

Add code
Jun 16, 2026
Viaarxiv icon

Are you speaking my languages? On spoken language adherence in multimodal LLMs

Add code
Jun 15, 2026
Viaarxiv icon

Intelligibility of Speech in Noise: Investigating Contribution of Magnitude and Phase Spectra

Add code
Jun 15, 2026
Viaarxiv icon