Alert button
Picture for Zhiyong Wu

Zhiyong Wu

Alert button

Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation

Add code
Bookmark button
Alert button
Jan 15, 2024
Zhiwei Lin, Jun Chen, Boshi Tang, Binzhu Sha, Jing Yang, Yaolong Ju, Fan Fan, Shiyin Kang, Zhiyong Wu, Helen Meng

Viaarxiv icon

Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness

Add code
Bookmark button
Alert button
Jan 07, 2024
Sicheng Yang, Zunnan Xu, Haiwei Xue, Yongkang Cheng, Shaoli Huang, Mingming Gong, Zhiyong Wu

Viaarxiv icon

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation

Add code
Bookmark button
Alert button
Dec 24, 2023
Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng

Viaarxiv icon

SECap: Speech Emotion Captioning with Large Language Model

Add code
Bookmark button
Alert button
Dec 23, 2023
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shixiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu

Viaarxiv icon

StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis

Add code
Bookmark button
Alert button
Dec 19, 2023
Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng

Figure 1 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 2 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 3 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 4 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Viaarxiv icon

SimCalib: Graph Neural Network Calibration based on Similarity between Nodes

Add code
Bookmark button
Alert button
Dec 19, 2023
Boshi Tang, Zhiyong Wu, Xixin Wu, Qiaochu Huang, Jun Chen, Shun Lei, Helen Meng

Viaarxiv icon

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations

Add code
Bookmark button
Alert button
Dec 18, 2023
Zilin Wang, Haolin Zhuang, Lu Li, Yinmin Zhang, Junjie Zhong, Jun Chen, Yu Yang, Boshi Tang, Zhiyong Wu

Viaarxiv icon

Stable Score Distillation for High-Quality 3D Generation

Add code
Bookmark button
Alert button
Dec 14, 2023
Boshi Tang, Jianan Wang, Zhiyong Wu, Lei Zhang

Figure 1 for Stable Score Distillation for High-Quality 3D Generation
Figure 2 for Stable Score Distillation for High-Quality 3D Generation
Figure 3 for Stable Score Distillation for High-Quality 3D Generation
Figure 4 for Stable Score Distillation for High-Quality 3D Generation
Viaarxiv icon

neural concatenative singing voice conversion: rethinking concatenation-based approach for one-shot singing voice conversion

Add code
Bookmark button
Alert button
Dec 08, 2023
Binzhu Sha, Xu Li, Zhiyong Wu, Ying Shan, Helen Meng

Viaarxiv icon

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Add code
Bookmark button
Alert button
Nov 15, 2023
Fangzhi Xu, Zhiyong Wu, Qiushi Sun, Siyu Ren, Fei Yuan, Shuai Yuan, Qika Lin, Yu Qiao, Jun Liu

Viaarxiv icon