Alert button
Picture for Yanqing Liu

Yanqing Liu

Alert button

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Bookmark button
Alert button
Mar 05, 2024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Add code
Bookmark button
Alert button
Feb 12, 2024
Naoyuki Kanda, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Hemin Yang, Zirun Zhu, Min Tang, Canrun Li, Steven Tsai, Zhen Xiao, Yufei Xia, Jinzhu Li, Yanqing Liu, Sheng Zhao, Michael Zeng

Viaarxiv icon

MLLMs-Augmented Visual-Language Representation Learning

Add code
Bookmark button
Alert button
Dec 01, 2023
Yanqing Liu, Kai Wang, Wenqi Shao, Ping Luo, Yu Qiao, Mike Zheng Shou, Kaipeng Zhang, Yang You

Figure 1 for MLLMs-Augmented Visual-Language Representation Learning
Figure 2 for MLLMs-Augmented Visual-Language Representation Learning
Figure 3 for MLLMs-Augmented Visual-Language Representation Learning
Figure 4 for MLLMs-Augmented Visual-Language Representation Learning
Viaarxiv icon

DREAM+: Efficient Dataset Distillation by Bidirectional Representative Matching

Add code
Bookmark button
Alert button
Oct 23, 2023
Yanqing Liu, Jianyang Gu, Kai Wang, Zheng Zhu, Kaipeng Zhang, Wei Jiang, Yang You

Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Bookmark button
Alert button
Sep 05, 2023
Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian

Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Add code
Bookmark button
Alert button
May 04, 2023
Kai Shen, Zeqian Ju, Xu Tan, Yanqing Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian

Figure 1 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 2 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 3 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 4 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Viaarxiv icon

DREAM: Efficient Dataset Distillation by Representative Matching

Add code
Bookmark button
Alert button
Mar 09, 2023
Yanqing Liu, Jianyang Gu, Kai Wang, Zheng Zhu, Wei Jiang, Yang You

Figure 1 for DREAM: Efficient Dataset Distillation by Representative Matching
Figure 2 for DREAM: Efficient Dataset Distillation by Representative Matching
Figure 3 for DREAM: Efficient Dataset Distillation by Representative Matching
Figure 4 for DREAM: Efficient Dataset Distillation by Representative Matching
Viaarxiv icon

FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model

Add code
Bookmark button
Alert button
Mar 08, 2023
Ruiqing Xue, Yanqing Liu, Lei He, Xu Tan, Linquan Liu, Edward Lin, Sheng Zhao

Figure 1 for FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Figure 2 for FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Figure 3 for FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Figure 4 for FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Viaarxiv icon