speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Speech Encoder Fusion for LLM-based Automatic Speech Recognition

Add code
Jun 09, 2026
Viaarxiv icon

Phoneme-First Prediction for LLM-Based Speech Recognition

Add code
Jun 09, 2026
Viaarxiv icon

Speaker Group Encoding in Self-supervised Speech Recognition Models

Add code
Jun 09, 2026
Viaarxiv icon

Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation

Add code
Jun 09, 2026
Viaarxiv icon

Towards Deep Contextual Reasoning from Broad Descriptions for ASR with Speech-LLM via Metadata-Driven Reasoning Chains

Add code
Jun 09, 2026
Viaarxiv icon

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

Add code
Jun 09, 2026
Viaarxiv icon

Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling

Add code
Jun 09, 2026
Viaarxiv icon

GC-LoRA: Gated Convolutional LoRA for Parameter-Efficient Acoustic Adaptation

Add code
Jun 09, 2026
Viaarxiv icon

Entropy-Aware Domain-Routed Mixture-of-Experts Speech-LLM Framework: A Case Study of Multi-Domain Child-Adult ASR

Add code
Jun 09, 2026
Viaarxiv icon

Rethinking Depth: A study of the Recursive-Transformer for Speech Recognition

Add code
Jun 08, 2026
Viaarxiv icon