speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models

Add code
Jan 28, 2026
Viaarxiv icon

Mind the Shift: Using Delta SSL Embeddings to Enhance Child ASR

Add code
Jan 28, 2026
Viaarxiv icon

Do we really need Self-Attention for Streaming Automatic Speech Recognition?

Add code
Jan 27, 2026
Viaarxiv icon

Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition

Add code
Jan 27, 2026
Viaarxiv icon

Pisets: A Robust Speech Recognition System for Lectures and Interviews

Add code
Jan 26, 2026
Viaarxiv icon

SLM-SS: Speech Language Model for Generative Speech Separation

Add code
Jan 27, 2026
Viaarxiv icon

Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Add code
Jan 27, 2026
Viaarxiv icon

SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper

Add code
Jan 27, 2026
Viaarxiv icon

An Effective Energy Mask-based Adversarial Evasion Attacks against Misclassification in Speaker Recognition Systems

Add code
Jan 29, 2026
Viaarxiv icon

MA-LipNet: Multi-Dimensional Attention Networks for Robust Lipreading

Add code
Jan 27, 2026
Viaarxiv icon