speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Can Large Language Models Reliably Correct Errors in Low-Resource ASR? A Contamination-Aware Case Study on West Frisian

Add code
May 19, 2026
Viaarxiv icon

Evaluation of Conversational Agents: Understanding Culture, Context and Environment in Emotion Detection

Add code
May 28, 2026
Viaarxiv icon

Contextual Biasing for Streaming ASR via CTC-based Word Spotting

Add code
May 19, 2026
Viaarxiv icon

FormalASR: End-to-End Spoken Chinese to Formal Text

Add code
May 19, 2026
Viaarxiv icon

Too Good to Be True: A Study on Modern Automatic Speech Recognition for the Evaluation of Speech Enhancement

Add code
May 12, 2026
Viaarxiv icon

Text Analytics Evaluation Framework: A Case Study on LLMs and Social Media

Add code
May 20, 2026
Viaarxiv icon

Word-Level Modeling with Alignment-Aware Acoustic Fusion for Text-Assisted Intelligibility Prediction in Listeners with Hearing Loss

Add code
May 22, 2026
Viaarxiv icon

Vividh-ASR: A Complexity-Tiered Benchmark and Optimization Dynamics for Robust Indic Speech Recognition

Add code
May 13, 2026
Viaarxiv icon

Cross-Talk Speech Reduction, by Separation, for Separation

Add code
May 19, 2026
Viaarxiv icon

Streaming Speech-to-Text Translation with a SpeechLLM

Add code
May 14, 2026
Viaarxiv icon