speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

What do Speech Foundation Models Learn? Analysis and Applications

Add code
Aug 17, 2025
Viaarxiv icon

Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens

Add code
Aug 13, 2025
Figure 1 for Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens
Viaarxiv icon

Bridging ASR and LLMs for Dysarthric Speech Recognition: Benchmarking Self-Supervised and Generative Approaches

Add code
Aug 11, 2025
Viaarxiv icon

Depression diagnosis from patient interviews using multimodal machine learning

Add code
Aug 26, 2025
Viaarxiv icon

AD-AVSR: Asymmetric Dual-stream Enhancement for Robust Audio-Visual Speech Recognition

Add code
Aug 11, 2025
Viaarxiv icon

DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition

Add code
Aug 12, 2025
Viaarxiv icon

CarelessWhisper: Turning Whisper into a Causal Streaming Model

Add code
Aug 17, 2025
Viaarxiv icon

Landmark Guided Visual Feature Extractor for Visual Speech Recognition with Limited Resource

Add code
Aug 10, 2025
Viaarxiv icon

Pitch Accent Detection improves Pretrained Automatic Speech Recognition

Add code
Aug 06, 2025
Viaarxiv icon

Revealing the Role of Audio Channels in ASR Performance Degradation

Add code
Aug 12, 2025
Viaarxiv icon