speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation

Add code
Jun 09, 2026
Viaarxiv icon

Towards Deep Contextual Reasoning from Broad Descriptions for ASR with Speech-LLM via Metadata-Driven Reasoning Chains

Add code
Jun 09, 2026
Viaarxiv icon

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

Add code
Jun 09, 2026
Viaarxiv icon

Rethinking Depth: A study of the Recursive-Transformer for Speech Recognition

Add code
Jun 08, 2026
Viaarxiv icon

Parameter-Efficient Continual Learning for Automatic Speech Recognition

Add code
Jun 08, 2026
Viaarxiv icon

Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling

Add code
Jun 09, 2026
Viaarxiv icon

GC-LoRA: Gated Convolutional LoRA for Parameter-Efficient Acoustic Adaptation

Add code
Jun 09, 2026
Viaarxiv icon

Entropy-Aware Domain-Routed Mixture-of-Experts Speech-LLM Framework: A Case Study of Multi-Domain Child-Adult ASR

Add code
Jun 09, 2026
Viaarxiv icon

A study on the impact of region specific data on the performance of Indic ASR

Add code
Jun 08, 2026
Viaarxiv icon

Titans-as-a-Layer: Test-Time Memory for Conversational Speech Emotion Recognition

Add code
Jun 07, 2026
Viaarxiv icon