speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Improving End-to-End Speech Recognition for Dysarthric Speech through In-Domain Data Augmentation

Add code
Jun 18, 2026
Viaarxiv icon

Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models

Add code
Jun 18, 2026
Viaarxiv icon

Low-Burden Data Augmentation for Dysarthric ASR via Zero-Shot Voice Cloning

Add code
Jun 18, 2026
Viaarxiv icon

ReNikud: Audio-Supervised Hebrew Grapheme-to-Phoneme Conversion

Add code
Jun 18, 2026
Viaarxiv icon

Cross-Dataset, Age, and Gender Generalization: A Comprehensive Analysis of Fine-Tuning Strategies for Low-Resource Children's ASR

Add code
Jun 18, 2026
Viaarxiv icon

A Comparative Study of Pretrained Transformer Models for Quranic ASR: Speech Representations, Label Formats, and Dataset Composition

Add code
Jun 18, 2026
Viaarxiv icon

DASH: Dual-View Self-Distillation with Multi-Layer Hidden Representations for Robust Speech Recognition

Add code
Jun 17, 2026
Viaarxiv icon

IndicContextEval: A Benchmark for Evaluating Context Utilisation in Audio Large Language Models Across 8 Indic Languages

Add code
Jun 17, 2026
Viaarxiv icon

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

Add code
Jun 17, 2026
Viaarxiv icon

Low-resource Language Discrimination Towards Chinese Dialects with Transfer learning and Data Augmentation

Add code
Jun 17, 2026
Viaarxiv icon