speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Multilingual Phonological Feature Recognition with Self-Supervised Speech Models

Add code
May 25, 2026
Viaarxiv icon

Hardware-Aware Federated Learning for Speech Emotion Recognition

Add code
May 23, 2026
Viaarxiv icon

Convex Low-resource Accent-Robust Language Detection in Speech Recognition

Add code
May 22, 2026
Viaarxiv icon

Phonetic Modeling of Dialectal Variation in Vietnamese Speech

Add code
May 23, 2026
Viaarxiv icon

AI Security Research Should Better Incentivize Defense Research

Add code
May 22, 2026
Viaarxiv icon

Evaluation of Conversational Agents: Understanding Culture, Context and Environment in Emotion Detection

Add code
May 28, 2026
Viaarxiv icon

StepAudio 2.5 Technical Report

Add code
May 22, 2026
Viaarxiv icon

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Add code
May 19, 2026
Viaarxiv icon

Plug-in Losses for Evidential Deep Learning: A Simplified Framework for Uncertainty Estimation that Includes the Softmax Classifier

Add code
May 21, 2026
Viaarxiv icon

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

Add code
May 21, 2026
Viaarxiv icon