speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

A study on the impact of region specific data on the performance of Indic ASR

Add code
Jun 08, 2026
Viaarxiv icon

LLM can Read Spectrogram: Encoder-free Speech-Language Modeling

Add code
Jun 08, 2026
Viaarxiv icon

Is Text All You Need? Text as a Universal Information Bottleneck for Speech LLMs

Add code
Jun 08, 2026
Viaarxiv icon

Contrastive Training with LLM-generated Near-Misses for Robust Code-Switching Speech Recognition

Add code
Jun 05, 2026
Viaarxiv icon

FiLM-Based Speaker Conditioning of a SpeechLLM for Pathological Speech Recognition

Add code
Jun 04, 2026
Viaarxiv icon

Multi-task Learning is Not Enough: Representational Entanglement in Dual-output Second Language Speech Recognition

Add code
Jun 04, 2026
Viaarxiv icon

M2S-AVSR: Modality-aware Multi-view Self-supervised Representation for Robust Audio-Visual Speech Recognition

Add code
Jun 04, 2026
Viaarxiv icon

Acoustic Cue Alignment in Audio Language Models for Speech Emotion Recognition

Add code
Jun 05, 2026
Viaarxiv icon

Beyond Waveform Robustness: Robust Feature-Vocoder Adversarial Attacks on Automatic Speech Recognition

Add code
Jun 04, 2026
Viaarxiv icon

Hearing the Unspoken: Language Model Priors for Acoustic Adversarial Attacks

Add code
Jun 05, 2026
Viaarxiv icon