speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan

Add code
Mar 27, 2026
Viaarxiv icon

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Add code
Mar 27, 2026
Viaarxiv icon

AdaLTM: Adaptive Layer-wise Task Vector Merging for Categorical Speech Emotion Recognition with ASR Knowledge Integration

Add code
Mar 26, 2026
Viaarxiv icon

JAL-Turn: Joint Acoustic-Linguistic Modeling for Real-Time and Robust Turn-Taking Detection in Full-Duplex Spoken Dialogue Systems

Add code
Mar 27, 2026
Viaarxiv icon

Evaluating Interactive 2D Visualization as a Sample Selection Strategy for Biomedical Time-Series Data Annotation

Add code
Mar 27, 2026
Viaarxiv icon

How Class Ontology and Data Scale Affect Audio Transfer Learning

Add code
Mar 26, 2026
Viaarxiv icon

Goodness-of-pronunciation without phoneme time alignment

Add code
Mar 26, 2026
Viaarxiv icon

Back to Basics: Revisiting ASR in the Age of Voice Agents

Add code
Mar 26, 2026
Viaarxiv icon

A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English

Add code
Mar 25, 2026
Viaarxiv icon

From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs

Add code
Mar 25, 2026
Viaarxiv icon