speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Text-only adaptation in LLM-based ASR through text denoising

Add code
Jan 28, 2026
Viaarxiv icon

A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models

Add code
Jan 28, 2026
Viaarxiv icon

Qwen3-ASR Technical Report

Add code
Jan 29, 2026
Viaarxiv icon

Mind the Shift: Using Delta SSL Embeddings to Enhance Child ASR

Add code
Jan 28, 2026
Viaarxiv icon

SLM-SS: Speech Language Model for Generative Speech Separation

Add code
Jan 27, 2026
Viaarxiv icon

SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays

Add code
Jan 25, 2026
Viaarxiv icon

Language Family Matters: Evaluating LLM-Based ASR Across Linguistic Boundaries

Add code
Jan 26, 2026
Viaarxiv icon

Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Add code
Jan 27, 2026
Viaarxiv icon

Window Size Versus Accuracy Experiments in Voice Activity Detectors

Add code
Jan 24, 2026
Viaarxiv icon

Multilingual Dysarthric Speech Assessment Using Universal Phone Recognition and Language-Specific Phonemic Contrast Modeling

Add code
Jan 29, 2026
Viaarxiv icon