speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

Add code
Mar 30, 2026
Viaarxiv icon

LLM Probe: Evaluating LLMs for Low-Resource Languages

Add code
Mar 31, 2026
Viaarxiv icon

On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR

Add code
Mar 30, 2026
Viaarxiv icon

Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

Add code
Apr 10, 2026
Viaarxiv icon

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Add code
Mar 27, 2026
Viaarxiv icon

Crab: Multi Layer Contrastive Supervision to Improve Speech Emotion Recognition Under Both Acted and Natural Speech Condition

Add code
Mar 24, 2026
Viaarxiv icon

How Class Ontology and Data Scale Affect Audio Transfer Learning

Add code
Mar 26, 2026
Viaarxiv icon

Goodness-of-pronunciation without phoneme time alignment

Add code
Mar 26, 2026
Viaarxiv icon

From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs

Add code
Mar 25, 2026
Viaarxiv icon

Back to Basics: Revisiting ASR in the Age of Voice Agents

Add code
Mar 26, 2026
Viaarxiv icon