Alert button
Picture for Xu Tan

Xu Tan

Alert button

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Oct 25, 2023
Dingyao Yu, Kaitao Song, Peiling Lu, Tianyu He, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

Figure 1 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 2 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 3 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 4 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Viaarxiv icon

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

Oct 11, 2023
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng

Figure 1 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 2 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 3 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 4 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Viaarxiv icon

MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation

Sep 20, 2023
Xinda Wu, Zhijie Huang, Kejun Zhang, Jiaxing Yu, Xu Tan, Tieyao Zhang, Zihao Wang, Lingyun Sun

Figure 1 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Figure 2 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Figure 3 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Figure 4 for MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Viaarxiv icon

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Sep 15, 2023
Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang

Figure 1 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Figure 2 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Figure 3 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Figure 4 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Sep 05, 2023
Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian

Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer

Aug 11, 2023
Liyang Chen, Zhiyong Wu, Runnan Li, Weihong Bao, Jun Ling, Xu Tan, Sheng Zhao

Figure 1 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 2 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 3 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 4 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Viaarxiv icon

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading

Jul 03, 2023
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee

Figure 1 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 2 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 3 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 4 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Viaarxiv icon

EmoGen: Eliminating Subjective Bias in Emotional Music Generation

Jul 03, 2023
Chenfei Kang, Peiling Lu, Botao Yu, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

Figure 1 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 2 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 3 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Figure 4 for EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Viaarxiv icon