Alert button
Picture for Yichong Leng

Yichong Leng

Alert button

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Bookmark button
Alert button
Mar 05, 2024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Add code
Bookmark button
Alert button
Feb 12, 2024
Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou

Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Bookmark button
Alert button
Sep 05, 2023
Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian

Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

Extract and Attend: Improving Entity Translation in Neural Machine Translation

Add code
Bookmark button
Alert button
Jun 04, 2023
Zixin Zeng, Rui Wang, Yichong Leng, Junliang Guo, Xu Tan, Tao Qin, Tie-yan Liu

Figure 1 for Extract and Attend: Improving Entity Translation in Neural Machine Translation
Figure 2 for Extract and Attend: Improving Entity Translation in Neural Machine Translation
Figure 3 for Extract and Attend: Improving Entity Translation in Neural Machine Translation
Figure 4 for Extract and Attend: Improving Entity Translation in Neural Machine Translation
Viaarxiv icon

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Add code
Bookmark button
Alert button
May 04, 2023
Kai Shen, Zeqian Ju, Xu Tan, Yanqing Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian

Figure 1 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 2 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 3 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 4 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Viaarxiv icon

ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

Add code
Bookmark button
Alert button
Dec 30, 2022
Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo Mandic

Figure 1 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Figure 2 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Figure 3 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Figure 4 for ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Viaarxiv icon

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 02, 2022
Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu

Figure 1 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 2 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 3 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 4 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Viaarxiv icon

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction

Add code
Bookmark button
Alert button
Nov 23, 2022
Kai Shen, Yichong Leng, Xu Tan, Siliang Tang, Yuan Zhang, Wenjie Liu, Edward Lin

Figure 1 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 2 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 3 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 4 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Viaarxiv icon

PromptTTS: Controllable Text-to-Speech with Text Descriptions

Add code
Bookmark button
Alert button
Nov 22, 2022
Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan

Figure 1 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Figure 2 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Figure 3 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Figure 4 for PromptTTS: Controllable Text-to-Speech with Text Descriptions
Viaarxiv icon