speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary

Add code
Jun 11, 2025
Viaarxiv icon

SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research

Add code
Jun 10, 2025
Viaarxiv icon

Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems

Add code
Jun 16, 2025
Viaarxiv icon

Unified Semi-Supervised Pipeline for Automatic Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition

Add code
Jun 10, 2025
Viaarxiv icon

Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition

Add code
Jun 06, 2025
Viaarxiv icon

Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

Add code
Jun 12, 2025
Viaarxiv icon

NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025

Add code
Jun 16, 2025
Viaarxiv icon

Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models

Add code
Jun 16, 2025
Viaarxiv icon