speech


VoxKnesset: A Large-Scale Longitudinal Hebrew Speech Dataset for Aging Speaker Modeling

Add code
Mar 05, 2026
Viaarxiv icon

Stan: An LLM-based thermodynamics course assistant

Add code
Mar 04, 2026
Viaarxiv icon

FlowW2N: Whispered-to-Normal Speech Conversion via Flow-Matching

Add code
Mar 04, 2026
Viaarxiv icon

ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis

Add code
Mar 04, 2026
Viaarxiv icon

VietNormalizer: An Open-Source, Dependency-Free Python Library for Vietnamese Text Normalization in TTS and NLP Applications

Add code
Mar 04, 2026
Viaarxiv icon

Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Add code
Mar 04, 2026
Viaarxiv icon

Cyclostationarity Analysis as a Complement to Self-Supervised Representations for Speech Deepfake Detection

Add code
Mar 04, 2026
Viaarxiv icon

Robust LLM-based Audio-Visual Speech Recognition with Sparse Modality Alignment and Visual Unit-Guided Refinement

Add code
Mar 04, 2026
Viaarxiv icon

Linguistically Informed Graph Model and Semantic Contrastive Learning for Korean Short Text Classification

Add code
Mar 04, 2026
Viaarxiv icon

SilentWear: an Ultra-Low Power Wearable System for EMG-based Silent Speech Recognition

Add code
Mar 04, 2026
Viaarxiv icon