speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

Add code
Apr 28, 2026
Viaarxiv icon

Do LLM Decoders Listen Fairly? Benchmarking How Language Model Priors Shape Bias in Speech Recognition

Add code
Apr 23, 2026
Viaarxiv icon

Au-M-ol: A Unified Model for Medical Audio and Language Understanding

Add code
Apr 25, 2026
Viaarxiv icon

Voice of India: A Large-Scale Benchmark for Real-World Speech Recognition in India

Add code
Apr 21, 2026
Viaarxiv icon

DM-ASR: Diarization-aware Multi-speaker ASR with Large Language Models

Add code
Apr 24, 2026
Viaarxiv icon

Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines

Add code
Apr 22, 2026
Viaarxiv icon

ATIR: Towards Audio-Text Interleaved Contextual Retrieval

Add code
Apr 22, 2026
Viaarxiv icon

Reducing the Offline-Streaming Gap for Unified ASR Transducer with Consistency Regularization

Add code
Apr 21, 2026
Viaarxiv icon

Detecting Hallucinations in SpeechLLMs at Inference Time Using Attention Maps

Add code
Apr 21, 2026
Viaarxiv icon

"This Wasn't Made for Me": Recentering User Experience and Emotional Impact in the Evaluation of ASR Bias

Add code
Apr 22, 2026
Viaarxiv icon