Picture for Yuanzhe Chen

Yuanzhe Chen

StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion

Add code
Aug 05, 2024
Viaarxiv icon

Improving Audio Generation with Visual Enhanced Caption

Add code
Jul 05, 2024
Viaarxiv icon

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

Add code
Jun 04, 2024
Figure 1 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Figure 2 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Figure 3 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Figure 4 for Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Apr 27, 2024
Viaarxiv icon

StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Add code
Feb 07, 2024
Viaarxiv icon

LM-VC: Zero-shot Voice Conversion via Speech Generation based on Language Models

Add code
Jun 18, 2023
Viaarxiv icon

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion

Add code
May 12, 2023
Viaarxiv icon

Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network

Add code
Dec 12, 2022
Viaarxiv icon

Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints

Add code
Nov 16, 2022
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Add code
Oct 27, 2022
Viaarxiv icon