speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Joint ASR and Speaker Role Tagging with Serialized Output Training

Add code
Jun 12, 2025
Viaarxiv icon

Unified Semi-Supervised Pipeline for Automatic Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Improving Named Entity Transcription with Contextual LLM-based Revision

Add code
Jun 12, 2025
Viaarxiv icon

(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test

Add code
Jun 13, 2025
Viaarxiv icon

Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia

Add code
Jun 10, 2025
Viaarxiv icon

Enabling automatic transcription of child-centered audio recordings from real-world environments

Add code
Jun 13, 2025
Viaarxiv icon

Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments

Add code
Jun 13, 2025
Viaarxiv icon

Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms

Add code
Jun 12, 2025
Viaarxiv icon

Speech Recognition on TV Series with Video-guided Post-Correction

Add code
Jun 08, 2025
Viaarxiv icon