speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

Add code
Jul 02, 2025
Viaarxiv icon

PERTINENCE: Input-based Opportunistic Neural Network Dynamic Execution

Add code
Jul 02, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Viaarxiv icon

Accurate, fast, cheap: Choose three. Replacing Multi-Head-Attention with Bidirectional Recurrent Attention for Long-Form ASR

Add code
Jun 24, 2025
Viaarxiv icon

Hybrid Deep Learning and Signal Processing for Arabic Dialect Recognition in Low-Resource Settings

Add code
Jun 26, 2025
Viaarxiv icon

Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition

Add code
Jun 17, 2025
Viaarxiv icon

An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Add code
Jun 18, 2025
Viaarxiv icon

Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios

Add code
Jun 17, 2025
Viaarxiv icon

Exploiting Music Source Separation for Automatic Lyrics Transcription with Whisper

Add code
Jun 18, 2025
Viaarxiv icon

Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems

Add code
Jun 16, 2025
Viaarxiv icon