speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

ROMPAR: Morphological Completion and Demographic Unlearning for Romanian-Accented Speech Recognition

Add code
Jun 14, 2026
Viaarxiv icon

MambAdapter: Lightweight Mamba-Based Adapters for Parameter-Efficient Transfer Learning in Speech and Audio

Add code
Jun 14, 2026
Viaarxiv icon

S-JEPA : Soft Clustering Anchors for Self-Supervised Speech Representation Learning

Add code
Jun 17, 2026
Viaarxiv icon

Improving Code-Switching ASR with Code-Mixing Guided Synthetic Speech

Add code
Jun 14, 2026
Viaarxiv icon

Stringalign: Moving beyond summary statistics with a transparent Unicode-aware tool for evaluating automatic transcription models

Add code
Jun 14, 2026
Viaarxiv icon

MoDiCoL: A Modular Diagnostic Continual Learning Dataset for Robust Speech Recognition

Add code
Jun 12, 2026
Viaarxiv icon

A Practical Evaluation Method for Long-Form Simultaneous Speech-to-Speech Translation

Add code
Jun 13, 2026
Viaarxiv icon

Towards Personalized Federated Learning for Dysarthric Speech Recognition

Add code
Jun 11, 2026
Viaarxiv icon

Balancing ASR and diarization in end-to-end LLMs for multi-talker speech recognition

Add code
Jun 11, 2026
Viaarxiv icon

Learning to Hear Hesitation: Continual Learning for Disfluency-Aware ASR

Add code
Jun 12, 2026
Viaarxiv icon