Alert button
Picture for Jinzheng He

Jinzheng He

Alert button

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis

Add code
Bookmark button
Alert button
Jan 20, 2024
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao

Viaarxiv icon

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts

Add code
Bookmark button
Alert button
Jul 14, 2023
Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 2 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 3 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 4 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Viaarxiv icon

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

Add code
Bookmark button
Alert button
May 24, 2023
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao

Figure 1 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 2 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 3 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 4 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Viaarxiv icon

ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer

Add code
Bookmark button
Alert button
May 22, 2023
Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao

Figure 1 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Figure 2 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Figure 3 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Figure 4 for ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Viaarxiv icon

Wav2SQL: Direct Generalizable Speech-To-SQL Parsing

Add code
Bookmark button
Alert button
May 21, 2023
Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao

Figure 1 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Figure 2 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Figure 3 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Figure 4 for Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Viaarxiv icon

CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training

Add code
Bookmark button
Alert button
May 18, 2023
Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao

Figure 1 for CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training
Figure 2 for CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training
Figure 3 for CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training
Figure 4 for CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training
Viaarxiv icon

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis

Add code
Bookmark button
Alert button
May 18, 2023
Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao

Figure 1 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 2 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 3 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 4 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Viaarxiv icon

GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation

Add code
Bookmark button
Alert button
May 01, 2023
Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Figure 2 for GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Figure 3 for GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Figure 4 for GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Viaarxiv icon

TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation

Add code
Bookmark button
Alert button
May 25, 2022
Rongjie Huang, Zhou Zhao, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He

Figure 1 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 2 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 3 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 4 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Viaarxiv icon

PopMAG: Pop Music Accompaniment Generation

Add code
Bookmark button
Alert button
Aug 18, 2020
Yi Ren, Jinzheng He, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu

Figure 1 for PopMAG: Pop Music Accompaniment Generation
Figure 2 for PopMAG: Pop Music Accompaniment Generation
Figure 3 for PopMAG: Pop Music Accompaniment Generation
Figure 4 for PopMAG: Pop Music Accompaniment Generation
Viaarxiv icon