speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Evaluating Interactive 2D Visualization as a Sample Selection Strategy for Biomedical Time-Series Data Annotation

Add code
Mar 27, 2026
Viaarxiv icon

Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition

Add code
Mar 11, 2026
Viaarxiv icon

Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition

Add code
Mar 12, 2026
Viaarxiv icon

Contextual Biasing for ASR in Speech LLM with Common Word Cues and Bias Word Position Prediction

Add code
Apr 14, 2026
Viaarxiv icon

RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery

Add code
Mar 17, 2026
Viaarxiv icon

LLMs and Speech: Integration vs. Combination

Add code
Mar 16, 2026
Viaarxiv icon

An Empirical Recipe for Universal Phone Recognition

Add code
Mar 30, 2026
Viaarxiv icon

Is Semi-Automatic Transcription Useful in Corpus Creation? Preliminary Considerations on the KIParla Corpus

Add code
Mar 17, 2026
Viaarxiv icon

Tagarela - A Portuguese speech dataset from podcasts

Add code
Mar 16, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon