speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Sequence-Level Unsupervised Training in Speech Recognition: A Theoretical Study

Add code
Mar 02, 2026
Viaarxiv icon

Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach

Add code
Mar 13, 2026
Viaarxiv icon

SilentWear: an Ultra-Low Power Wearable System for EMG-based Silent Speech Recognition

Add code
Mar 04, 2026
Viaarxiv icon

Benchmarking Speech Systems for Frontline Health Conversations: The DISPLACE-M Challenge

Add code
Mar 05, 2026
Viaarxiv icon

Using Songs to Improve Kazakh Automatic Speech Recognition

Add code
Mar 03, 2026
Viaarxiv icon

Visual-Informed Speech Enhancement Using Attention-Based Beamforming

Add code
Mar 05, 2026
Viaarxiv icon

When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper

Add code
Mar 05, 2026
Viaarxiv icon

The Patrologia Graeca Corpus: OCR, Annotation, and Open Release of Noisy Nineteenth-Century Polytonic Greek Editions

Add code
Mar 10, 2026
Viaarxiv icon

PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration

Add code
Mar 05, 2026
Viaarxiv icon

WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech

Add code
Mar 05, 2026
Viaarxiv icon