Picture for Qixi Zheng

Qixi Zheng

X-VC: Zero-shot Streaming Voice Conversion in Codec Space

Add code
Apr 14, 2026
Viaarxiv icon

Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling

Add code
May 26, 2025
Figure 1 for Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling
Figure 2 for Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling
Figure 3 for Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling
Figure 4 for Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling
Viaarxiv icon

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Figure 1 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 2 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 3 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 4 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Viaarxiv icon