speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery

Add code
Mar 17, 2026
Viaarxiv icon

LLMs and Speech: Integration vs. Combination

Add code
Mar 16, 2026
Viaarxiv icon

Is Semi-Automatic Transcription Useful in Corpus Creation? Preliminary Considerations on the KIParla Corpus

Add code
Mar 17, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon

Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR

Add code
Mar 17, 2026
Viaarxiv icon

Tagarela - A Portuguese speech dataset from podcasts

Add code
Mar 16, 2026
Viaarxiv icon

Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition

Add code
Mar 12, 2026
Viaarxiv icon

Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition

Add code
Mar 11, 2026
Viaarxiv icon

Huntington Disease Automatic Speech Recognition with Biomarker Supervision

Add code
Mar 11, 2026
Viaarxiv icon

Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition

Add code
Mar 10, 2026
Viaarxiv icon