Alert button
Picture for Zhou Zhao

Zhou Zhao

Alert button

MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer

Add code
Bookmark button
Alert button
Jun 12, 2023
Yazheng Yang, Zhou Zhao, Qi Liu

Figure 1 for MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer
Figure 2 for MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer
Figure 3 for MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer
Figure 4 for MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer
Viaarxiv icon

OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment

Add code
Bookmark button
Alert button
Jun 10, 2023
Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao

Figure 1 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 2 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 3 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Figure 4 for OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Viaarxiv icon

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias

Add code
Bookmark button
Alert button
Jun 06, 2023
Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 2 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 3 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 4 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Viaarxiv icon

Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis

Add code
Bookmark button
Alert button
Jun 06, 2023
Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Figure 2 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Figure 3 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Figure 4 for Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Viaarxiv icon

Detector Guidance for Multi-Object Text-to-Image Generation

Add code
Bookmark button
Alert button
Jun 04, 2023
Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao

Figure 1 for Detector Guidance for Multi-Object Text-to-Image Generation
Figure 2 for Detector Guidance for Multi-Object Text-to-Image Generation
Figure 3 for Detector Guidance for Multi-Object Text-to-Image Generation
Figure 4 for Detector Guidance for Multi-Object Text-to-Image Generation
Viaarxiv icon

Make-A-Voice: Unified Voice Synthesis With Discrete Representation

Add code
Bookmark button
Alert button
May 30, 2023
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu

Figure 1 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Figure 2 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Figure 3 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Figure 4 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Viaarxiv icon

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation

Add code
Bookmark button
Alert button
May 29, 2023
Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Figure 2 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Figure 3 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Figure 4 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Viaarxiv icon

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

Add code
Bookmark button
Alert button
May 24, 2023
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao

Figure 1 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 2 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 3 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 4 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Viaarxiv icon

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

Add code
Bookmark button
Alert button
May 24, 2023
Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao

Figure 1 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Figure 2 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Figure 3 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Figure 4 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Viaarxiv icon

FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models

Add code
Bookmark button
Alert button
May 23, 2023
Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao

Figure 1 for FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
Figure 2 for FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
Figure 3 for FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
Figure 4 for FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
Viaarxiv icon