Picture for Zeqian Ju

Zeqian Ju

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Apr 25, 2024
Viaarxiv icon

RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Add code
Apr 06, 2024
Viaarxiv icon

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Mar 05, 2024
Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Sep 05, 2023
Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Add code
May 04, 2023
Figure 1 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 2 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 3 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 4 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Viaarxiv icon

AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models

Add code
Apr 05, 2023
Figure 1 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 2 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 3 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 4 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Viaarxiv icon

TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method

Add code
Sep 20, 2021
Figure 1 for TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method
Figure 2 for TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method
Figure 3 for TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method
Figure 4 for TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method
Viaarxiv icon

MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training

Add code
Jun 10, 2021
Figure 1 for MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
Figure 2 for MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
Figure 3 for MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
Figure 4 for MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
Viaarxiv icon

On the Generation of Medical Dialogues for COVID-19

Add code
May 11, 2020
Figure 1 for On the Generation of Medical Dialogues for COVID-19
Figure 2 for On the Generation of Medical Dialogues for COVID-19
Figure 3 for On the Generation of Medical Dialogues for COVID-19
Figure 4 for On the Generation of Medical Dialogues for COVID-19
Viaarxiv icon

MedDialog: A Large-scale Medical Dialogue Dataset

Add code
Apr 07, 2020
Figure 1 for MedDialog: A Large-scale Medical Dialogue Dataset
Figure 2 for MedDialog: A Large-scale Medical Dialogue Dataset
Figure 3 for MedDialog: A Large-scale Medical Dialogue Dataset
Figure 4 for MedDialog: A Large-scale Medical Dialogue Dataset
Viaarxiv icon