Alert button
Picture for Rongjie Huang

Rongjie Huang

Alert button

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

Add code
Bookmark button
Alert button
Apr 16, 2024
Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang

Viaarxiv icon

3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization

Add code
Bookmark button
Alert button
Mar 29, 2024
Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Tinglong Zhu, Changhe Song, Rongjie Huang, Ziyang Ma, Qian Chen, Shiliang Zhang, Xihao Li

Figure 1 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 2 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 3 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 4 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Viaarxiv icon

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt

Add code
Bookmark button
Alert button
Mar 18, 2024
Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao

Figure 1 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 2 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 3 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 4 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Viaarxiv icon

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialung Zuo, Shulei Wang, Zhou Zhao

Viaarxiv icon

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis

Add code
Bookmark button
Alert button
Jan 20, 2024
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao

Viaarxiv icon

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis

Add code
Bookmark button
Alert button
Jan 02, 2024
Yu Zhang, Rongjie Huang, Ruiqi Li, JinZheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

Viaarxiv icon

TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation

Add code
Bookmark button
Alert button
Dec 23, 2023
Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, changpeng yang, Zhou Zhao

Viaarxiv icon

StyleSinger: Style Transfer for Out-Of-Domain Singing Voice Synthesis

Add code
Bookmark button
Alert button
Dec 17, 2023
Yu Zhang, Rongjie Huang, Ruiqi Li, JinZheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

Viaarxiv icon

Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers

Add code
Bookmark button
Alert button
Dec 15, 2023
Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao

Figure 1 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 2 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 3 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 4 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Viaarxiv icon

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

Add code
Bookmark button
Alert button
Oct 11, 2023
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng

Figure 1 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 2 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 3 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Figure 4 for UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Viaarxiv icon