Alert button
Picture for Zhou Zhao

Zhou Zhao

Alert button

MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech

Feb 14, 2024
Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao

Viaarxiv icon

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Feb 12, 2024
Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou

Viaarxiv icon

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis

Jan 20, 2024
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao

Viaarxiv icon

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis

Jan 02, 2024
Yu Zhang, Rongjie Huang, Ruiqi Li, JinZheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

Viaarxiv icon

TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation

Dec 23, 2023
Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, changpeng yang, Zhou Zhao

Viaarxiv icon

Language Model is a Branch Predictor for Simultaneous Machine Translation

Dec 22, 2023
Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao

Viaarxiv icon

Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding

Dec 21, 2023
Haifeng Huang, Yang Zhao, Zehan Wang, Yan Xia, Zhou Zhao

Viaarxiv icon

StyleSinger: Style Transfer for Out-Of-Domain Singing Voice Synthesis

Dec 17, 2023
Yu Zhang, Rongjie Huang, Ruiqi Li, JinZheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

Viaarxiv icon

Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers

Dec 15, 2023
Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao

Figure 1 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 2 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 3 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Figure 4 for Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Viaarxiv icon

Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast

Dec 11, 2023
Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Liqun Deng, Wenqiao Zhang, Zhenhua Dong, Ruiming Tang, Xin Jiang

Figure 1 for Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast
Figure 2 for Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast
Figure 3 for Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast
Figure 4 for Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast
Viaarxiv icon