speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese

Add code
Aug 27, 2025
Viaarxiv icon

Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

Add code
Sep 03, 2025
Viaarxiv icon

Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database

Add code
Aug 26, 2025
Viaarxiv icon

NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration

Add code
Aug 29, 2025
Viaarxiv icon

Improving Noise Robust Audio-Visual Speech Recognition via Router-Gated Cross-Modal Feature Fusion

Add code
Aug 26, 2025
Viaarxiv icon

Generative Annotation for ASR Named Entity Correction

Add code
Aug 28, 2025
Viaarxiv icon

Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech?

Add code
Aug 28, 2025
Viaarxiv icon

Hybrid Decoding: Rapid Pass and Selective Detailed Correction for Sequence Models

Add code
Aug 27, 2025
Viaarxiv icon

Designing Practical Models for Isolated Word Visual Speech Recognition

Add code
Aug 25, 2025
Viaarxiv icon

Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech

Add code
Aug 25, 2025
Viaarxiv icon