speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

Add code
Jul 23, 2025
Viaarxiv icon

BoSS: Beyond-Semantic Speech

Add code
Jul 23, 2025
Viaarxiv icon

Natural Language Processing for Tigrinya: Current State and Future Directions

Add code
Jul 23, 2025
Viaarxiv icon

Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task

Add code
Jul 23, 2025
Viaarxiv icon

Automatically assessing oral narratives of Afrikaans and isiXhosa children

Add code
Jul 18, 2025
Viaarxiv icon

Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review

Add code
Jul 10, 2025
Viaarxiv icon

NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech

Add code
Jul 17, 2025
Viaarxiv icon

Reading Between the Lines: Combining Pause Dynamics and Semantic Coherence for Automated Assessment of Thought Disorder

Add code
Jul 17, 2025
Viaarxiv icon

Improving Contextual ASR via Multi-grained Fusion with Large Language Models

Add code
Jul 16, 2025
Viaarxiv icon