speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

A Cookbook for Community-driven Data Collection of Impaired Speech in LowResource Languages

Add code
Jul 03, 2025
Viaarxiv icon

Benchmarking Akan ASR Models Across Domain-Specific Datasets: A Comparative Evaluation of Performance, Scalability, and Adaptability

Add code
Jul 03, 2025
Viaarxiv icon

Open-Source System for Multilingual Translation and Cloned Speech Synthesis

Add code
Jul 03, 2025
Viaarxiv icon

Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla

Add code
Jul 02, 2025
Viaarxiv icon

AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance

Add code
Jul 02, 2025
Viaarxiv icon

First Steps Towards Voice Anonymization for Code-Switching Speech

Add code
Jul 02, 2025
Viaarxiv icon

Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

Add code
Jul 02, 2025
Viaarxiv icon

PERTINENCE: Input-based Opportunistic Neural Network Dynamic Execution

Add code
Jul 02, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Viaarxiv icon

Accurate, fast, cheap: Choose three. Replacing Multi-Head-Attention with Bidirectional Recurrent Attention for Long-Form ASR

Add code
Jun 24, 2025
Viaarxiv icon