speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

Add code
Jun 12, 2025
Viaarxiv icon

OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary

Add code
Jun 11, 2025
Viaarxiv icon

(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test

Add code
Jun 13, 2025
Viaarxiv icon

SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research

Add code
Jun 10, 2025
Viaarxiv icon

Joint ASR and Speaker Role Tagging with Serialized Output Training

Add code
Jun 12, 2025
Viaarxiv icon

Improving Named Entity Transcription with Contextual LLM-based Revision

Add code
Jun 12, 2025
Viaarxiv icon

MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions

Add code
Jun 11, 2025
Viaarxiv icon

Enabling automatic transcription of child-centered audio recordings from real-world environments

Add code
Jun 13, 2025
Viaarxiv icon

Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments

Add code
Jun 13, 2025
Viaarxiv icon

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon