speech


Uncovering the Functional Roles of Nonlinearity in Memory

Add code
Jun 09, 2025
Viaarxiv icon

Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU

Add code
Jun 09, 2025
Viaarxiv icon

Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch

Add code
Jun 09, 2025
Viaarxiv icon

DEBATE: A Dataset for Disentangling Textual Ambiguity in Mandarin Through Speech

Add code
Jun 09, 2025
Viaarxiv icon

Audio-Sync Video Generation with Multi-Stream Temporal Control

Add code
Jun 09, 2025
Viaarxiv icon

Unified Semi-Supervised Pipeline for Automatic Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation

Add code
Jun 09, 2025
Viaarxiv icon

DeRAGEC: Denoising Named Entity Candidates with Synthetic Rationale for ASR Error Correction

Add code
Jun 09, 2025
Viaarxiv icon

Towards Generalized Source Tracing for Codec-Based Deepfake Speech

Add code
Jun 08, 2025
Viaarxiv icon