Alert button
Picture for Xu Tan

Xu Tan

Alert button

WuYun: Exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning

Add code
Bookmark button
Alert button
Jan 11, 2023
Kejun Zhang, Xinda Wu, Tieyao Zhang, Zhijie Huang, Xu Tan, Qihao Liang, Songruoyao Wu, Lingyun Sun

Figure 1 for WuYun: Exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning
Figure 2 for WuYun: Exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning
Figure 3 for WuYun: Exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning
Figure 4 for WuYun: Exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning
Viaarxiv icon

ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

Add code
Bookmark button
Alert button
Dec 30, 2022
Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo Mandic

Figure 1 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Figure 2 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Figure 3 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Figure 4 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Viaarxiv icon

Difformer: Empowering Diffusion Model on Embedding Space for Text Generation

Add code
Bookmark button
Alert button
Dec 19, 2022
Zhujin Gao, Junliang Guo, Xu Tan, Yongxin Zhu, Fang Zhang, Jiang Bian, Linli Xu

Figure 1 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Figure 2 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Figure 3 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Figure 4 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Viaarxiv icon

Memories are One-to-Many Mapping Alleviators in Talking Face Generation

Add code
Bookmark button
Alert button
Dec 12, 2022
Anni Tang, Tianyu He, Xu Tan, Jun Ling, Runnan Li, Sheng Zhao, Li Song, Jiang Bian

Figure 1 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 2 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 3 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 4 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Viaarxiv icon

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 02, 2022
Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu

Figure 1 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 2 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 3 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 4 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Viaarxiv icon

VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing

Add code
Bookmark button
Alert button
Nov 30, 2022
Yihan Wu, Junliang Guo, Xu Tan, Chen Zhang, Bohan Li, Ruihua Song, Lei He, Sheng Zhao, Arul Menezes, Jiang Bian

Figure 1 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Figure 2 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Figure 3 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Figure 4 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Viaarxiv icon

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction

Add code
Bookmark button
Alert button
Nov 23, 2022
Kai Shen, Yichong Leng, Xu Tan, Siliang Tang, Yuan Zhang, Wenjie Liu, Edward Lin

Figure 1 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 2 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 3 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 4 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Viaarxiv icon

PromptTTS: Controllable Text-to-Speech with Text Descriptions

Add code
Bookmark button
Alert button
Nov 22, 2022
Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan

Figure 1 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Figure 2 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Figure 3 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Figure 4 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Viaarxiv icon

Towards Understanding Omission in Dialogue Summarization

Add code
Bookmark button
Alert button
Nov 14, 2022
Yicheng Zou, Kaitao Song, Xu Tan, Zhongkai Fu, Tao Gui, Qi Zhang, Dongsheng Li

Figure 1 for Towards Understanding Omission in Dialogue Summarization
Figure 2 for Towards Understanding Omission in Dialogue Summarization
Figure 3 for Towards Understanding Omission in Dialogue Summarization
Figure 4 for Towards Understanding Omission in Dialogue Summarization
Viaarxiv icon