Picture for Xu Tan

Xu Tan

GAIA: Zero-shot Talking Avatar Generation

Add code
Nov 26, 2023
Figure 1 for GAIA: Zero-shot Talking Avatar Generation
Figure 2 for GAIA: Zero-shot Talking Avatar Generation
Figure 3 for GAIA: Zero-shot Talking Avatar Generation
Figure 4 for GAIA: Zero-shot Talking Avatar Generation
Viaarxiv icon

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Add code
Oct 25, 2023
Viaarxiv icon

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

Add code
Oct 11, 2023
Figure 1 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 2 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 3 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 4 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Viaarxiv icon

MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation

Add code
Sep 20, 2023
Figure 1 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Figure 2 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Figure 3 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Figure 4 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Viaarxiv icon

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Add code
Sep 15, 2023
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Sep 05, 2023
Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer

Add code
Aug 11, 2023
Figure 1 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 2 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 3 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 4 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Viaarxiv icon

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading

Add code
Jul 03, 2023
Viaarxiv icon

EmoGen: Eliminating Subjective Bias in Emotional Music Generation

Add code
Jul 03, 2023
Figure 1 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 2 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 3 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 4 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Viaarxiv icon

Extract and Attend: Improving Entity Translation in Neural Machine Translation

Add code
Jun 04, 2023
Viaarxiv icon