speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios

Add code
Jun 17, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Figure 1 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 2 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 3 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 4 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Viaarxiv icon

An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Add code
Jun 18, 2025
Viaarxiv icon

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon

Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding

Add code
Jun 13, 2025
Viaarxiv icon

Accurate, fast, cheap: Choose three. Replacing Multi-Head-Attention with Bidirectional Recurrent Attention for Long-Form ASR

Add code
Jun 24, 2025
Viaarxiv icon

Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform

Add code
Jun 13, 2025
Viaarxiv icon

Regularizing Learnable Feature Extraction for Automatic Speech Recognition

Add code
Jun 11, 2025
Viaarxiv icon

SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition

Add code
Jun 15, 2025
Viaarxiv icon

FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition

Add code
Jun 12, 2025
Viaarxiv icon