speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Benchmarking von ASR-Modellen im deutschen medizinischen Kontext: Eine Leistungsanalyse anhand von Anamnesegesprächen

Add code
Jan 23, 2026
Viaarxiv icon

From Human Speech to Ocean Signals: Transferring Speech Large Models for Underwater Acoustic Target Recognition

Add code
Jan 26, 2026
Viaarxiv icon

Scaling Ambiguity: Augmenting Human Annotation in Speech Emotion Recognition with Audio-Language Models

Add code
Jan 21, 2026
Viaarxiv icon

DementiaBank-Emotion: A Multi-Rater Emotion Annotation Corpus for Alzheimer's Disease Speech (Version 1.0)

Add code
Feb 04, 2026
Viaarxiv icon

Inverse-Hessian Regularization for Continual Learning in ASR

Add code
Jan 21, 2026
Viaarxiv icon

Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface

Add code
Jan 21, 2026
Viaarxiv icon

Sink or SWIM: Tackling Real-Time ASR at Scale

Add code
Jan 22, 2026
Viaarxiv icon

SoundBreak: A Systematic Study of Audio-Only Adversarial Attacks on Trimodal Models

Add code
Jan 20, 2026
Viaarxiv icon

Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding

Add code
Jan 20, 2026
Viaarxiv icon

CTC-DID: CTC-Based Arabic dialect identification for streaming applications

Add code
Jan 18, 2026
Viaarxiv icon