speech


Probabilistic Verification of Voice Anti-Spoofing Models

Add code
Mar 12, 2026
Viaarxiv icon

Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2

Add code
Mar 12, 2026
Viaarxiv icon

RAF: Relativistic Adversarial Feedback For Universal Speech Synthesis

Add code
Mar 12, 2026
Viaarxiv icon

AnimeScore: A Preference-Based Dataset and Framework for Evaluating Anime-Like Speech Style

Add code
Mar 12, 2026
Viaarxiv icon

TASTE-Streaming: Towards Streamable Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling

Add code
Mar 12, 2026
Viaarxiv icon

Acoustic-to-Articulatory Inversion of Clean Speech Using an MRI-Trained Model

Add code
Mar 12, 2026
Viaarxiv icon

Learnable Pulse Accumulation for On-Device Speech Recognition: How Much Attention Do You Need?

Add code
Mar 11, 2026
Viaarxiv icon

QV May Be Enough: Toward the Essence of Attention in LLMs

Add code
Mar 11, 2026
Viaarxiv icon

Duration Aware Scheduling for ASR Serving Under Workload Drift

Add code
Mar 11, 2026
Viaarxiv icon

Synthetic Data Domain Adaptation for ASR via LLM-based Text and Phonetic Respelling Augmentation

Add code
Mar 11, 2026
Viaarxiv icon