Alert button
Picture for Sheng Zhao

Sheng Zhao

Alert button

VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer

Add code
Bookmark button
Alert button
Aug 11, 2023
Liyang Chen, Zhiyong Wu, Runnan Li, Weihong Bao, Jun Ling, Xu Tan, Sheng Zhao

Figure 1 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 2 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 3 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 4 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Viaarxiv icon

The detection and rectification for identity-switch based on unfalsified control

Add code
Bookmark button
Alert button
Jul 27, 2023
Junchao Huang, Xiaoqi He, Sheng Zhao

Figure 1 for The detection and rectification for identity-switch based on unfalsified control
Figure 2 for The detection and rectification for identity-switch based on unfalsified control
Figure 3 for The detection and rectification for identity-switch based on unfalsified control
Figure 4 for The detection and rectification for identity-switch based on unfalsified control
Viaarxiv icon

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading

Add code
Bookmark button
Alert button
Jul 03, 2023
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee

Figure 1 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 2 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 3 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 4 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Viaarxiv icon

An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023

Add code
Bookmark button
Alert button
Jul 03, 2023
Sheng Zhao, Qilong Yuan, Yibo Duan, Zhuoyue Chen

Figure 1 for An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023
Figure 2 for An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023
Figure 3 for An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023
Figure 4 for An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023
Viaarxiv icon

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Add code
Bookmark button
Alert button
May 04, 2023
Kai Shen, Zeqian Ju, Xu Tan, Yanqing Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian

Figure 1 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 2 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 3 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Figure 4 for NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Viaarxiv icon

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder

Add code
Bookmark button
Alert button
Apr 23, 2023
Chenpng Du, Qi Chen, Tianyu He, Xu Tan, Xie Chen, Kai Yu, Sheng Zhao, Jiang Bian

Figure 1 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Figure 2 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Figure 3 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Figure 4 for DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Viaarxiv icon

AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models

Add code
Bookmark button
Alert button
Apr 05, 2023
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao

Figure 1 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 2 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 3 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Figure 4 for AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Viaarxiv icon