Picture for Yuancheng Wang

Yuancheng Wang

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Add code
Sep 01, 2024
Viaarxiv icon

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

Add code
Jul 07, 2024
Viaarxiv icon

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

Add code
Jul 01, 2024
Viaarxiv icon

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Add code
Jun 19, 2024
Viaarxiv icon

RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Add code
Apr 06, 2024
Viaarxiv icon

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Mar 05, 2024
Viaarxiv icon

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Add code
Dec 15, 2023
Figure 1 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 2 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 3 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 4 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Viaarxiv icon

Trustworthy Multi-phase Liver Tumor Segmentation via Evidence-based Uncertainty

Add code
May 09, 2023
Viaarxiv icon

AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models

Add code
Apr 05, 2023
Viaarxiv icon

An Attention-Based Approach for Single Image Super Resolution

Add code
Jul 18, 2018
Figure 1 for An Attention-Based Approach for Single Image Super Resolution
Figure 2 for An Attention-Based Approach for Single Image Super Resolution
Figure 3 for An Attention-Based Approach for Single Image Super Resolution
Figure 4 for An Attention-Based Approach for Single Image Super Resolution
Viaarxiv icon