speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Automatic Speech Recognition of African American English: Lexical and Contextual Effects

Add code
Jun 07, 2025
Viaarxiv icon

MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions

Add code
Jun 11, 2025
Viaarxiv icon

NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025

Add code
Jun 16, 2025
Viaarxiv icon

Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia

Add code
Jun 10, 2025
Viaarxiv icon

Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models

Add code
Jun 16, 2025
Viaarxiv icon

Speech Recognition on TV Series with Video-guided Post-Correction

Add code
Jun 08, 2025
Viaarxiv icon

Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR

Add code
Jun 16, 2025
Viaarxiv icon

Technical Report: A Practical Guide to Kaldi ASR Optimization

Add code
Jun 08, 2025
Viaarxiv icon

Unifying Streaming and Non-streaming Zipformer-based ASR

Add code
Jun 17, 2025
Figure 1 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 2 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 3 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 4 for Unifying Streaming and Non-streaming Zipformer-based ASR
Viaarxiv icon

BUT System for the MLC-SLM Challenge

Add code
Jun 16, 2025
Viaarxiv icon