speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Joint decoding method for controllable contextual speech recognition based on Speech LLM

Add code
Aug 12, 2025
Viaarxiv icon

Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens

Add code
Aug 13, 2025
Viaarxiv icon

Bridging ASR and LLMs for Dysarthric Speech Recognition: Benchmarking Self-Supervised and Generative Approaches

Add code
Aug 11, 2025
Viaarxiv icon

DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition

Add code
Aug 12, 2025
Viaarxiv icon

AD-AVSR: Asymmetric Dual-stream Enhancement for Robust Audio-Visual Speech Recognition

Add code
Aug 11, 2025
Viaarxiv icon

FlexCTC: GPU-powered CTC Beam Decoding With Advanced Contextual Abilities

Add code
Aug 13, 2025
Viaarxiv icon

Revealing the Role of Audio Channels in ASR Performance Degradation

Add code
Aug 12, 2025
Viaarxiv icon

Landmark Guided Visual Feature Extractor for Visual Speech Recognition with Limited Resource

Add code
Aug 10, 2025
Viaarxiv icon

Out of the Box, into the Clinic? Evaluating State-of-the-Art ASR for Clinical Applications for Older Adults

Add code
Aug 12, 2025
Viaarxiv icon

TurboBias: Universal ASR Context-Biasing powered by GPU-accelerated Phrase-Boosting Tree

Add code
Aug 12, 2025
Viaarxiv icon