Picture for Dake Guo

Dake Guo

FlashTTS: Fast Streaming TTS with MTP Acceleration and X-pred Mean Flow Distillation

Add code
Jun 09, 2026
Viaarxiv icon

MeanVC 2: Robust Low-Latency Streaming Zero-Shot Voice Conversion

Add code
Jun 08, 2026
Viaarxiv icon

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Add code
Apr 20, 2026
Viaarxiv icon

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement

Add code
Mar 21, 2026
Viaarxiv icon

Qwen3-TTS Technical Report

Add code
Jan 22, 2026
Viaarxiv icon

VoiceSculptor: Your Voice, Designed By You

Add code
Jan 15, 2026
Viaarxiv icon

FlexSpeech: Towards Stable, Controllable and Expressive Text-to-Speech

Add code
May 08, 2025
Viaarxiv icon

The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge

Add code
Oct 31, 2024
Viaarxiv icon

The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings

Add code
Oct 31, 2024
Figure 1 for The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings
Figure 2 for The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings
Figure 3 for The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings
Figure 4 for The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings
Viaarxiv icon

NPU-NTU System for Voice Privacy 2024 Challenge

Add code
Sep 06, 2024
Figure 1 for NPU-NTU System for Voice Privacy 2024 Challenge
Figure 2 for NPU-NTU System for Voice Privacy 2024 Challenge
Viaarxiv icon