speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Streaming Speech-to-Text Translation with a SpeechLLM

Add code
May 14, 2026
Viaarxiv icon

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

Add code
May 14, 2026
Viaarxiv icon

REALM: Retrospective Encoder Alignment for LFP Modeling

Add code
May 14, 2026
Viaarxiv icon

Vividh-ASR: A Complexity-Tiered Benchmark and Optimization Dynamics for Robust Indic Speech Recognition

Add code
May 13, 2026
Viaarxiv icon

Too Good to Be True: A Study on Modern Automatic Speech Recognition for the Evaluation of Speech Enhancement

Add code
May 12, 2026
Viaarxiv icon

Mind the Pause: Disfluency-Aware Objective Tuning for Multilingual Speech Correction with LLMs

Add code
May 12, 2026
Viaarxiv icon

Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model

Add code
May 12, 2026
Viaarxiv icon

Responsible Benchmarking of Fairness for Automatic Speech Recognition

Add code
May 11, 2026
Viaarxiv icon

ORICF -- Open Robotics Inference and Control Framework

Add code
May 10, 2026
Viaarxiv icon

WorldSpeech: A Multilingual Speech Corpus from Around the World

Add code
May 09, 2026
Viaarxiv icon