speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios

Add code
Jun 17, 2025
Viaarxiv icon

PERTINENCE: Input-based Opportunistic Neural Network Dynamic Execution

Add code
Jul 02, 2025
Viaarxiv icon

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

Add code
Jul 16, 2025
Viaarxiv icon

Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon

An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Add code
Jun 18, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Figure 1 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 2 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 3 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 4 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Viaarxiv icon

Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding

Add code
Jun 13, 2025
Viaarxiv icon

Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform

Add code
Jun 13, 2025
Viaarxiv icon

Regularizing Learnable Feature Extraction for Automatic Speech Recognition

Add code
Jun 11, 2025
Viaarxiv icon

SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition

Add code
Jun 15, 2025
Viaarxiv icon