speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

Add code
Apr 28, 2026
Viaarxiv icon

WhisperPipe: A Resource-Efficient Streaming Architecture for Real-Time Automatic Speech Recognition

Add code
Apr 28, 2026
Viaarxiv icon

Unrequited Emotions: Investigating the Gaps in Motivation and Practice in Speech Emotion Recognition Research

Add code
Apr 28, 2026
Viaarxiv icon

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

Add code
Apr 28, 2026
Viaarxiv icon

2nd of the 5th PVUW MeViS-Audio Track: ASR-SaSaSa2VA

Add code
Apr 27, 2026
Viaarxiv icon

Au-M-ol: A Unified Model for Medical Audio and Language Understanding

Add code
Apr 25, 2026
Viaarxiv icon

Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Add code
Apr 24, 2026
Viaarxiv icon

Advancing automatic speech recognition using feature fusion with self-supervised learning features: A case study on Fearless Steps Apollo corpus

Add code
Apr 24, 2026
Viaarxiv icon

Evaluation of Automatic Speech Recognition Using Generative Large Language Models

Add code
Apr 23, 2026
Viaarxiv icon

DM-ASR: Diarization-aware Multi-speaker ASR with Large Language Models

Add code
Apr 24, 2026
Viaarxiv icon