speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

DeRAGEC: Denoising Named Entity Candidates with Synthetic Rationale for ASR Error Correction

Add code
Jun 09, 2025
Viaarxiv icon

Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation

Add code
Jun 09, 2025
Viaarxiv icon

Enabling automatic transcription of child-centered audio recordings from real-world environments

Add code
Jun 13, 2025
Viaarxiv icon

Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments

Add code
Jun 13, 2025
Viaarxiv icon

In-context Language Learning for Endangered Languages in Speech Recognition

Add code
May 28, 2025
Viaarxiv icon

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Add code
Jun 11, 2025
Viaarxiv icon

Uncovering the Functional Roles of Nonlinearity in Memory

Add code
Jun 09, 2025
Viaarxiv icon

SuPseudo: A Pseudo-supervised Learning Method for Neural Speech Enhancement in Far-field Speech Recognition

Add code
May 30, 2025
Viaarxiv icon

Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach

Add code
May 30, 2025
Viaarxiv icon

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon