Alert button
Picture for Xubo Liu

Xubo Liu

Alert button

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Nov 30, 2023
Jinhua Liang, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Viaarxiv icon

Audio Prompt Tuning for Universal Sound Separation

Nov 30, 2023
Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, Pingchuan Tain, Yuxuan Wang

Viaarxiv icon

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Oct 22, 2023
Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang

Viaarxiv icon

CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing

Oct 11, 2023
Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang

Viaarxiv icon

Synth-AC: Enhancing Audio Captioning with Synthetic Supervision

Sep 18, 2023
Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang

Figure 1 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 2 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 3 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 4 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Viaarxiv icon

Retrieval-Augmented Text-to-Audio Generation

Sep 14, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Retrieval-Augmented Text-to-Audio Generation
Figure 2 for Retrieval-Augmented Text-to-Audio Generation
Figure 3 for Retrieval-Augmented Text-to-Audio Generation
Figure 4 for Retrieval-Augmented Text-to-Audio Generation
Viaarxiv icon

Multimodal Fish Feeding Intensity Assessment in Aquaculture

Sep 10, 2023
Meng Cui, Xubo Liu, Haohe Liu, Zhuangzhuang Du, Tao Chen, Guoping Lian, Daoliang Li, Wenwu Wang

Figure 1 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Figure 2 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Figure 3 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Figure 4 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Viaarxiv icon

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Aug 10, 2023
Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley

Figure 1 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 2 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 3 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 4 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Viaarxiv icon

Separate Anything You Describe

Aug 09, 2023
Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Separate Anything You Describe
Figure 2 for Separate Anything You Describe
Figure 3 for Separate Anything You Describe
Figure 4 for Separate Anything You Describe
Viaarxiv icon

WavJourney: Compositional Audio Creation with Large Language Models

Jul 26, 2023
Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon