Alert button
Picture for Xiang-Yang Li

Xiang-Yang Li

Alert button

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Bookmark button
Alert button
Mar 05, 2024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Bookmark button
Alert button
Sep 05, 2023
Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian

Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

Tight Memory-Regret Lower Bounds for Streaming Bandits

Add code
Bookmark button
Alert button
Jun 13, 2023
Shaoang Li, Lan Zhang, Junhao Wang, Xiang-Yang Li

Figure 1 for Tight Memory-Regret Lower Bounds for Streaming Bandits
Viaarxiv icon

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 02, 2022
Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu

Figure 1 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 2 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 3 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 4 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Viaarxiv icon

Data Provenance Inference in Machine Learning

Add code
Bookmark button
Alert button
Nov 24, 2022
Mingxue Xu, Xiang-Yang Li

Viaarxiv icon

MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference

Add code
Bookmark button
Alert button
Sep 28, 2022
Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li

Figure 1 for MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Figure 2 for MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Figure 3 for MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Figure 4 for MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Viaarxiv icon

InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference

Add code
Bookmark button
Alert button
Sep 28, 2022
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, Miao-Hui Song, Xiang-Yang Li

Figure 1 for InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Figure 2 for InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Figure 3 for InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Figure 4 for InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Viaarxiv icon

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis

Add code
Bookmark button
Alert button
May 30, 2022
Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu

Figure 1 for BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Figure 2 for BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Figure 3 for BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Figure 4 for BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Viaarxiv icon

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 18, 2021
Yichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Wenjie Liu, Linquan Liu, Tao Qin, Xiang-Yang Li, Edward Lin, Tie-Yan Liu

Figure 1 for FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Figure 2 for FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Figure 3 for FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Figure 4 for FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Viaarxiv icon