speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Learning Multiple Utterance-Level Attribute Representations with a Unified Speech Encoder

Add code
Mar 09, 2026
Viaarxiv icon

The Patrologia Graeca Corpus: OCR, Annotation, and Open Release of Noisy Nineteenth-Century Polytonic Greek Editions

Add code
Mar 10, 2026
Viaarxiv icon

Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition

Add code
Mar 05, 2026
Viaarxiv icon

Beyond Word Error Rate: Auditing the Diversity Tax in Speech Recognition through Dataset Cartography

Add code
Mar 05, 2026
Viaarxiv icon

Robust LLM-based Audio-Visual Speech Recognition with Sparse Modality Alignment and Visual Unit-Guided Refinement

Add code
Mar 04, 2026
Viaarxiv icon

Benchmarking Speech Systems for Frontline Health Conversations: The DISPLACE-M Challenge

Add code
Mar 05, 2026
Viaarxiv icon

SilentWear: an Ultra-Low Power Wearable System for EMG-based Silent Speech Recognition

Add code
Mar 04, 2026
Viaarxiv icon

Visual-Informed Speech Enhancement Using Attention-Based Beamforming

Add code
Mar 05, 2026
Viaarxiv icon

When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper

Add code
Mar 05, 2026
Viaarxiv icon

PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration

Add code
Mar 05, 2026
Viaarxiv icon