Picture for Guoqiao Yu

Guoqiao Yu

LongCat-AudioDiT: High-Fidelity Diffusion Text-to-Speech in the Waveform Latent Space

Add code
Mar 31, 2026
Viaarxiv icon

AS-Speech: Adaptive Style For Speech Synthesis

Add code
Sep 09, 2024
Figure 1 for AS-Speech: Adaptive Style For Speech Synthesis
Figure 2 for AS-Speech: Adaptive Style For Speech Synthesis
Figure 3 for AS-Speech: Adaptive Style For Speech Synthesis
Figure 4 for AS-Speech: Adaptive Style For Speech Synthesis
Viaarxiv icon

Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios

Add code
Dec 23, 2021
Figure 1 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 2 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 3 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 4 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Viaarxiv icon