Alert button
Picture for Huadai Liu

Huadai Liu

Alert button

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

Add code
Bookmark button
Alert button
May 24, 2023
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao

Figure 1 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 2 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 3 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 4 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Viaarxiv icon

ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer

Add code
Bookmark button
Alert button
May 22, 2023
Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao

Figure 1 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Figure 2 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Figure 3 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Figure 4 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Viaarxiv icon

Wav2SQL: Direct Generalizable Speech-To-SQL Parsing

Add code
Bookmark button
Alert button
May 21, 2023
Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao

Figure 1 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Figure 2 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Figure 3 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Figure 4 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Viaarxiv icon

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis

Add code
Bookmark button
Alert button
May 18, 2023
Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao

Figure 1 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 2 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 3 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 4 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Viaarxiv icon

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech

Add code
Bookmark button
Alert button
Jul 13, 2022
Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren

Figure 1 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 2 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 3 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 4 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Viaarxiv icon

TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation

Add code
Bookmark button
Alert button
May 25, 2022
Rongjie Huang, Zhou Zhao, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He

Figure 1 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 2 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 3 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 4 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Viaarxiv icon