Picture for Jialong Mai

Jialong Mai

Parallel GPT: Harmonizing the Independence and Interdependence of Acoustic and Semantic Information for Zero-Shot Text-to-Speech

Add code
Aug 06, 2025
Viaarxiv icon

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Add code
May 12, 2025
Viaarxiv icon