speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Qwen3-ASR Technical Report

Add code
Jan 29, 2026
Viaarxiv icon

SLM-SS: Speech Language Model for Generative Speech Separation

Add code
Jan 27, 2026
Viaarxiv icon

Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Add code
Jan 27, 2026
Viaarxiv icon

SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper

Add code
Jan 27, 2026
Viaarxiv icon

Multilingual Dysarthric Speech Assessment Using Universal Phone Recognition and Language-Specific Phonemic Contrast Modeling

Add code
Jan 29, 2026
Viaarxiv icon

MA-LipNet: Multi-Dimensional Attention Networks for Robust Lipreading

Add code
Jan 27, 2026
Viaarxiv icon

Factored Reasoning with Inner Speech and Persistent Memory for Evidence-Grounded Human-Robot Interaction

Add code
Jan 31, 2026
Viaarxiv icon

DementiaBank-Emotion: A Multi-Rater Emotion Annotation Corpus for Alzheimer's Disease Speech (Version 1.0)

Add code
Feb 04, 2026
Viaarxiv icon

A Baseline Multimodal Approach to Emotion Recognition in Conversations

Add code
Jan 31, 2026
Viaarxiv icon

An Effective Energy Mask-based Adversarial Evasion Attacks against Misclassification in Speaker Recognition Systems

Add code
Jan 29, 2026
Viaarxiv icon