Alert button
Picture for Xu Tan

Xu Tan

Alert button

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Add code
Bookmark button
Alert button
May 04, 2023
Kai Shen, Zeqian Ju, Xu Tan, Yanqing Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian

Figure 1 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 2 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 3 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 4 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Viaarxiv icon

ResiDual: Transformer with Dual Residual Connections

Add code
Bookmark button
Alert button
Apr 28, 2023
Shufang Xie, Huishuai Zhang, Junliang Guo, Xu Tan, Jiang Bian, Hany Hassan Awadalla, Arul Menezes, Tao Qin, Rui Yan

Figure 1 for ResiDual: Transformer with Dual Residual Connections
Figure 2 for ResiDual: Transformer with Dual Residual Connections
Figure 3 for ResiDual: Transformer with Dual Residual Connections
Figure 4 for ResiDual: Transformer with Dual Residual Connections
Viaarxiv icon

CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval

Add code
Bookmark button
Alert button
Apr 24, 2023
Shangda Wu, Dingyao Yu, Xu Tan, Maosong Sun

Figure 1 for CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Figure 2 for CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Figure 3 for CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Figure 4 for CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Viaarxiv icon

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder

Add code
Bookmark button
Alert button
Apr 23, 2023
Chenpng Du, Qi Chen, Tianyu He, Xu Tan, Xie Chen, Kai Yu, Sheng Zhao, Jiang Bian

Figure 1 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Figure 2 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Figure 3 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Figure 4 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Viaarxiv icon

AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models

Add code
Bookmark button
Alert button
Apr 05, 2023
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao

Figure 1 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 2 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 3 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 4 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Viaarxiv icon

Improving Autoencoder-based Outlier Detection with Adjustable Probabilistic Reconstruction Error and Mean-shift Outlier Scoring

Add code
Bookmark button
Alert button
Apr 03, 2023
Xu Tan, Jiawei Yang, Junqi Chen, Sylwan Rahardja, Susanto Rahardja

Figure 1 for Improving Autoencoder-based Outlier Detection with Adjustable Probabilistic Reconstruction Error and Mean-shift Outlier Scoring
Figure 2 for Improving Autoencoder-based Outlier Detection with Adjustable Probabilistic Reconstruction Error and Mean-shift Outlier Scoring
Figure 3 for Improving Autoencoder-based Outlier Detection with Adjustable Probabilistic Reconstruction Error and Mean-shift Outlier Scoring
Figure 4 for Improving Autoencoder-based Outlier Detection with Adjustable Probabilistic Reconstruction Error and Mean-shift Outlier Scoring
Viaarxiv icon

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace

Add code
Bookmark button
Alert button
Apr 02, 2023
Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang

Figure 1 for HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Figure 2 for HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Figure 3 for HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Figure 4 for HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Viaarxiv icon