speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

VisG AV-HuBERT: Viseme-Guided AV-HuBERT

Add code
Apr 01, 2026
Viaarxiv icon

FLEURS-Kobani: Extending the FLEURS Dataset for Northern Kurdish

Add code
Mar 31, 2026
Viaarxiv icon

LLM Probe: Evaluating LLMs for Low-Resource Languages

Add code
Mar 31, 2026
Viaarxiv icon

EBuddy: a workflow orchestrator for industrial human-machine collaboration

Add code
Mar 30, 2026
Viaarxiv icon

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

Add code
Mar 30, 2026
Viaarxiv icon

On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR

Add code
Mar 30, 2026
Viaarxiv icon

Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan

Add code
Mar 27, 2026
Viaarxiv icon

AdaLTM: Adaptive Layer-wise Task Vector Merging for Categorical Speech Emotion Recognition with ASR Knowledge Integration

Add code
Mar 26, 2026
Viaarxiv icon

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Add code
Mar 27, 2026
Viaarxiv icon

An Empirical Recipe for Universal Phone Recognition

Add code
Mar 30, 2026
Viaarxiv icon