speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Plug-in Losses for Evidential Deep Learning: A Simplified Framework for Uncertainty Estimation that Includes the Softmax Classifier

Add code
May 21, 2026
Viaarxiv icon

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

Add code
May 21, 2026
Viaarxiv icon

Benchmarking Commercial ASR Systems on Code-Switching Speech: Arabic, Persian, and German

Add code
May 21, 2026
Viaarxiv icon

SCRIBE: Diagnostic Evaluation and Rich Transcription Models for Indic ASR

Add code
May 20, 2026
Viaarxiv icon

Evaluating Speech Articulation Synthesis with Articulatory Phoneme Recognition

Add code
May 20, 2026
Viaarxiv icon

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Add code
May 19, 2026
Viaarxiv icon

Text Analytics Evaluation Framework: A Case Study on LLMs and Social Media

Add code
May 20, 2026
Viaarxiv icon

Can Large Language Models Reliably Correct Errors in Low-Resource ASR? A Contamination-Aware Case Study on West Frisian

Add code
May 19, 2026
Viaarxiv icon

Contextual Biasing for Streaming ASR via CTC-based Word Spotting

Add code
May 19, 2026
Viaarxiv icon

FormalASR: End-to-End Spoken Chinese to Formal Text

Add code
May 19, 2026
Viaarxiv icon