speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Accurate, fast, cheap: Choose three. Replacing Multi-Head-Attention with Bidirectional Recurrent Attention for Long-Form ASR

Add code
Jun 24, 2025
Viaarxiv icon

An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Add code
Jun 18, 2025
Viaarxiv icon

Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding

Add code
Jun 13, 2025
Viaarxiv icon

SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition

Add code
Jun 15, 2025
Viaarxiv icon

Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform

Add code
Jun 13, 2025
Viaarxiv icon

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon

Regularizing Learnable Feature Extraction for Automatic Speech Recognition

Add code
Jun 11, 2025
Viaarxiv icon

FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition

Add code
Jun 12, 2025
Viaarxiv icon

OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary

Add code
Jun 11, 2025
Viaarxiv icon

SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research

Add code
Jun 10, 2025
Viaarxiv icon