speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Unrequited Emotions: Investigating the Gaps in Motivation and Practice in Speech Emotion Recognition Research

Add code
Apr 28, 2026
Viaarxiv icon

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

Add code
Apr 28, 2026
Viaarxiv icon

2nd of the 5th PVUW MeViS-Audio Track: ASR-SaSaSa2VA

Add code
Apr 27, 2026
Viaarxiv icon

Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Add code
Apr 24, 2026
Viaarxiv icon

Advancing automatic speech recognition using feature fusion with self-supervised learning features: A case study on Fearless Steps Apollo corpus

Add code
Apr 24, 2026
Viaarxiv icon

Evaluation of Automatic Speech Recognition Using Generative Large Language Models

Add code
Apr 23, 2026
Viaarxiv icon

Au-M-ol: A Unified Model for Medical Audio and Language Understanding

Add code
Apr 25, 2026
Viaarxiv icon

Do LLM Decoders Listen Fairly? Benchmarking How Language Model Priors Shape Bias in Speech Recognition

Add code
Apr 23, 2026
Viaarxiv icon

DM-ASR: Diarization-aware Multi-speaker ASR with Large Language Models

Add code
Apr 24, 2026
Viaarxiv icon

Voice of India: A Large-Scale Benchmark for Real-World Speech Recognition in India

Add code
Apr 21, 2026
Viaarxiv icon