Alert button
Picture for Haohe Liu

Haohe Liu

Alert button

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Add code
Bookmark button
Alert button
Aug 03, 2023
Ke Chen, Yusong Wu, Haohe Liu, Marianna Nezhurina, Taylor Berg-Kirkpatrick, Shlomo Dubnov

Figure 1 for MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Figure 2 for MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Figure 3 for MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Figure 4 for MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Viaarxiv icon

WavJourney: Compositional Audio Creation with Large Language Models

Add code
Bookmark button
Alert button
Jul 26, 2023
Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon

Text-Driven Foley Sound Generation With Latent Diffusion Model

Add code
Bookmark button
Alert button
Jun 23, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang

Figure 1 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 2 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 3 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 4 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Viaarxiv icon

E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks

Add code
Bookmark button
Alert button
May 30, 2023
Arshdeep Singh, Haohe Liu, Mark D. Plumbley

Figure 1 for E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks
Figure 2 for E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks
Figure 3 for E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks
Figure 4 for E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks
Viaarxiv icon

Adapting Language-Audio Models as Few-Shot Audio Learners

Add code
Bookmark button
Alert button
May 28, 2023
Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang

Figure 1 for Adapting Language-Audio Models as Few-Shot Audio Learners
Figure 2 for Adapting Language-Audio Models as Few-Shot Audio Learners
Figure 3 for Adapting Language-Audio Models as Few-Shot Audio Learners
Figure 4 for Adapting Language-Audio Models as Few-Shot Audio Learners
Viaarxiv icon

Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7

Add code
Bookmark button
Alert button
May 25, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Figure 2 for Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Figure 3 for Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Viaarxiv icon

Learning to detect an animal sound from five examples

Add code
Bookmark button
Alert button
May 22, 2023
Inês Nolasco, Shubhr Singh, Veronica Morfi, Vincent Lostanlen, Ariana Strandburg-Peshkin, Ester Vidaña-Vila, Lisa Gill, Hanna Pamuła, Helen Whitehead, Ivan Kiskin, Frants H. Jensen, Joe Morford, Michael G. Emmerson, Elisabetta Versace, Emily Grout, Haohe Liu, Dan Stowell

Figure 1 for Learning to detect an animal sound from five examples
Figure 2 for Learning to detect an animal sound from five examples
Figure 3 for Learning to detect an animal sound from five examples
Figure 4 for Learning to detect an animal sound from five examples
Viaarxiv icon

Universal Source Separation with Weakly Labelled Data

Add code
Bookmark button
Alert button
May 11, 2023
Qiuqiang Kong, Ke Chen, Haohe Liu, Xingjian Du, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Mark D. Plumbley

Figure 1 for Universal Source Separation with Weakly Labelled Data
Figure 2 for Universal Source Separation with Weakly Labelled Data
Figure 3 for Universal Source Separation with Weakly Labelled Data
Figure 4 for Universal Source Separation with Weakly Labelled Data
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Add code
Bookmark button
Alert button
Mar 30, 2023
Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang

Figure 1 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 2 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 3 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 4 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Viaarxiv icon