Picture for Chengzuo Yang

Chengzuo Yang

LongCat-AudioDiT: High-Fidelity Diffusion Text-to-Speech in the Waveform Latent Space

Add code
Mar 31, 2026
Viaarxiv icon