speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Evaluating Kubernetes Performance for GenAI Inference: From Automatic Speech Recognition to LLM Summarization

Add code
Feb 03, 2026
Viaarxiv icon

Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition

Add code
Feb 02, 2026
Viaarxiv icon

BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition

Add code
Feb 02, 2026
Viaarxiv icon

Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text

Add code
Feb 04, 2026
Viaarxiv icon

Attention-weighted Centered Kernel Alignment for Knowledge Distillation in Large Audio-Language Models Applied to Speech Emotion Recognition

Add code
Feb 02, 2026
Viaarxiv icon

Mići Princ -- A Little Boy Teaching Speech Technologies the Chakavian Dialect

Add code
Feb 03, 2026
Viaarxiv icon

Adapting Where It Matters: Depth-Aware Adaptation for Efficient Multilingual Speech Recognition in Low-Resource Languages

Add code
Feb 01, 2026
Viaarxiv icon

EmoAra: Emotion-Preserving English Speech Transcription and Cross-Lingual Translation with Arabic Text-to-Speech

Add code
Feb 01, 2026
Viaarxiv icon

WAXAL: A Large-Scale Multilingual African Language Speech Corpus

Add code
Feb 02, 2026
Viaarxiv icon

Benchmarking Automatic Speech Recognition for Indian Languages in Agricultural Contexts

Add code
Jan 31, 2026
Viaarxiv icon