Picture for Zhengyan Sheng

Zhengyan Sheng

Introducing voice timbre attribute detection

Add code
May 14, 2025
Viaarxiv icon

The Voice Timbre Attribute Detection 2025 Challenge Evaluation Plan

Add code
May 14, 2025
Viaarxiv icon

Unispeaker: A Unified Approach for Multimodality-driven Speaker Generation

Add code
Jan 11, 2025
Viaarxiv icon

CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models

Add code
Dec 13, 2024
Figure 1 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 2 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 3 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 4 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Viaarxiv icon

Voice Attribute Editing with Text Prompt

Add code
Apr 13, 2024
Viaarxiv icon