speech


An Exploration of Mamba for Speech Self-Supervised Models

Add code
Jun 14, 2025
Viaarxiv icon

StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling

Add code
Jun 14, 2025
Viaarxiv icon

Mitigating Non-Target Speaker Bias in Guided Speaker Embedding

Add code
Jun 14, 2025
Viaarxiv icon

Phonikud: Hebrew Grapheme-to-Phoneme Conversion for Real-Time Text-to-Speech

Add code
Jun 14, 2025
Viaarxiv icon

Towards Fairness Assessment of Dutch Hate Speech Detection

Add code
Jun 14, 2025
Viaarxiv icon

Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction

Add code
Jun 14, 2025
Viaarxiv icon

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon

Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding

Add code
Jun 13, 2025
Viaarxiv icon

Effectiveness of Counter-Speech against Abusive Content: A Multidimensional Annotation and Classification Study

Add code
Jun 13, 2025
Viaarxiv icon

Confidence-Based Self-Training for EMG-to-Speech: Leveraging Synthetic EMG for Robust Modeling

Add code
Jun 13, 2025
Viaarxiv icon