speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Speech Tokenizer is Key to Consistent Representation

Add code
Jul 09, 2025
Viaarxiv icon

PERTINENCE: Input-based Opportunistic Neural Network Dynamic Execution

Add code
Jul 02, 2025
Viaarxiv icon

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

Add code
Jul 16, 2025
Viaarxiv icon

An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Add code
Jun 18, 2025
Viaarxiv icon

Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding

Add code
Jun 13, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Figure 1 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 2 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 3 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 4 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Viaarxiv icon

Regularizing Learnable Feature Extraction for Automatic Speech Recognition

Add code
Jun 11, 2025
Viaarxiv icon

Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform

Add code
Jun 13, 2025
Viaarxiv icon

SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition

Add code
Jun 15, 2025
Viaarxiv icon

FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition

Add code
Jun 12, 2025
Viaarxiv icon