speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

SecureSpeech: Prompt-based Speaker and Content Protection

Add code
Jul 10, 2025
Viaarxiv icon

Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios

Add code
Jun 17, 2025
Viaarxiv icon

Accurate, fast, cheap: Choose three. Replacing Multi-Head-Attention with Bidirectional Recurrent Attention for Long-Form ASR

Add code
Jun 24, 2025
Viaarxiv icon

An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Add code
Jun 18, 2025
Viaarxiv icon

Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding

Add code
Jun 13, 2025
Viaarxiv icon

SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition

Add code
Jun 15, 2025
Viaarxiv icon

Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform

Add code
Jun 13, 2025
Viaarxiv icon

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon

Regularizing Learnable Feature Extraction for Automatic Speech Recognition

Add code
Jun 11, 2025
Viaarxiv icon

FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition

Add code
Jun 12, 2025
Viaarxiv icon