speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan

Add code
Mar 27, 2026
Viaarxiv icon

EBuddy: a workflow orchestrator for industrial human-machine collaboration

Add code
Mar 30, 2026
Viaarxiv icon

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

Add code
Mar 30, 2026
Viaarxiv icon

On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR

Add code
Mar 30, 2026
Viaarxiv icon

AdaLTM: Adaptive Layer-wise Task Vector Merging for Categorical Speech Emotion Recognition with ASR Knowledge Integration

Add code
Mar 26, 2026
Viaarxiv icon

A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English

Add code
Mar 25, 2026
Viaarxiv icon

Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages

Add code
Mar 24, 2026
Viaarxiv icon

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Add code
Mar 27, 2026
Viaarxiv icon

How Class Ontology and Data Scale Affect Audio Transfer Learning

Add code
Mar 26, 2026
Viaarxiv icon

Goodness-of-pronunciation without phoneme time alignment

Add code
Mar 26, 2026
Viaarxiv icon