Alert button
Picture for Zhou Zhao

Zhou Zhao

Alert button

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

Add code
Bookmark button
Alert button
Apr 16, 2024
Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang

Viaarxiv icon

Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey

Add code
Bookmark button
Alert button
Mar 31, 2024
Qijiong Liu, Jieming Zhu, Yanting Yang, Quanyu Dai, Zhaocheng Du, Xiao-Ming Wu, Zhou Zhao, Rui Zhang, Zhenhua Dong

Viaarxiv icon

MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation

Add code
Bookmark button
Alert button
Mar 19, 2024
Haoyu Zhao, Wenhui Dong, Rui Yu, Zhou Zhao, Du Bo, Yongchao Xu

Figure 1 for MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation
Figure 2 for MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation
Figure 3 for MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation
Figure 4 for MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation
Viaarxiv icon

WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising

Add code
Bookmark button
Alert button
Mar 19, 2024
Haoyu Zhao, Yuliang Gu, Zhou Zhao, Bo Du, Yongchao Xu, Rui Yu

Figure 1 for WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising
Figure 2 for WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising
Figure 3 for WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising
Figure 4 for WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising
Viaarxiv icon

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt

Add code
Bookmark button
Alert button
Mar 18, 2024
Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao

Figure 1 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 2 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 3 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 4 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Viaarxiv icon

Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment

Add code
Bookmark button
Alert button
Mar 08, 2024
Hai Huang, Yan Xia, Shengpeng Ji, Shulei Wang, Hanting Wang, Jieming Zhu, Zhenhua Dong, Zhou Zhao

Figure 1 for Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment
Figure 2 for Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment
Figure 3 for Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment
Figure 4 for Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment
Viaarxiv icon

A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching

Add code
Bookmark button
Alert button
Mar 05, 2024
Dong Yao, Asaad Alghamdi, Qingrong Xia, Xiaoye Qu, Xinyu Duan, Zhefeng Wang, Yi Zheng, Baoxing Huai, Peilun Cheng, Zhou Zhao

Figure 1 for A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Figure 2 for A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Figure 3 for A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Figure 4 for A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Viaarxiv icon

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialung Zuo, Shulei Wang, Zhou Zhao

Viaarxiv icon

MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech

Add code
Bookmark button
Alert button
Feb 14, 2024
Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao

Viaarxiv icon

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Add code
Bookmark button
Alert button
Feb 12, 2024
Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou

Viaarxiv icon