speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

DeRAGEC: Denoising Named Entity Candidates with Synthetic Rationale for ASR Error Correction

Add code
Jun 09, 2025
Viaarxiv icon

Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation

Add code
Jun 09, 2025
Viaarxiv icon

Enabling automatic transcription of child-centered audio recordings from real-world environments

Add code
Jun 13, 2025
Viaarxiv icon

Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments

Add code
Jun 13, 2025
Viaarxiv icon

In-context Language Learning for Endangered Languages in Speech Recognition

Add code
May 28, 2025
Viaarxiv icon

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Add code
Jun 11, 2025
Viaarxiv icon

SuPseudo: A Pseudo-supervised Learning Method for Neural Speech Enhancement in Far-field Speech Recognition

Add code
May 30, 2025
Viaarxiv icon

Uncovering the Functional Roles of Nonlinearity in Memory

Add code
Jun 09, 2025
Viaarxiv icon

Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach

Add code
May 30, 2025
Viaarxiv icon

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon