speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Evaluation of Automatic Speech Recognition Using Generative Large Language Models

Add code
Apr 23, 2026
Viaarxiv icon

Do LLM Decoders Listen Fairly? Benchmarking How Language Model Priors Shape Bias in Speech Recognition

Add code
Apr 23, 2026
Viaarxiv icon

Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines

Add code
Apr 22, 2026
Viaarxiv icon

ATIR: Towards Audio-Text Interleaved Contextual Retrieval

Add code
Apr 22, 2026
Viaarxiv icon

"This Wasn't Made for Me": Recentering User Experience and Emotional Impact in the Evaluation of ASR Bias

Add code
Apr 22, 2026
Viaarxiv icon

Voice of India: A Large-Scale Benchmark for Real-World Speech Recognition in India

Add code
Apr 21, 2026
Viaarxiv icon

Reducing the Offline-Streaming Gap for Unified ASR Transducer with Consistency Regularization

Add code
Apr 21, 2026
Viaarxiv icon

Detecting Hallucinations in SpeechLLMs at Inference Time Using Attention Maps

Add code
Apr 21, 2026
Viaarxiv icon

UAF: A Unified Audio Front-end LLM for Full-Duplex Speech Interaction

Add code
Apr 21, 2026
Viaarxiv icon

Where Do Self-Supervised Speech Models Become Unfair?

Add code
Apr 20, 2026
Viaarxiv icon