speech


Improving Code-Switching ASR with Code-Mixing Guided Synthetic Speech

Add code
Jun 14, 2026
Viaarxiv icon

ROMPAR: Morphological Completion and Demographic Unlearning for Romanian-Accented Speech Recognition

Add code
Jun 14, 2026
Viaarxiv icon

DuraMark: Duration-Embedded Watermarking in LLM-based TTS

Add code
Jun 13, 2026
Viaarxiv icon

Dynamic Prosody Prediction in LLM-based TTS for Improving Speaker Similarity

Add code
Jun 13, 2026
Viaarxiv icon

Phonetically Explainable Speech Deepfake Detection

Add code
Jun 13, 2026
Viaarxiv icon

Prior over Evidence: Stereotype-Driven Diagnosis in LLM-Based L2 Pronunciation Feedback

Add code
Jun 13, 2026
Viaarxiv icon

Stochastic Thermodynamics and SDE-based Generative Models

Add code
Jun 13, 2026
Viaarxiv icon

VoxWatermark: A Large-Scale Benchmark for Audio Watermark Detection under Perturbations

Add code
Jun 13, 2026
Viaarxiv icon

A Practical Evaluation Method for Long-Form Simultaneous Speech-to-Speech Translation

Add code
Jun 13, 2026
Viaarxiv icon

Evaluating and Preserving Lexical Stress in English-to-Chinese Speech-to-Speech Translation

Add code
Jun 13, 2026
Viaarxiv icon