Alert button
Picture for Wenwu Wang

Wenwu Wang

Alert button

Synth-AC: Enhancing Audio Captioning with Synthetic Supervision

Add code
Bookmark button
Alert button
Sep 18, 2023
Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang

Figure 1 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 2 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 3 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 4 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Viaarxiv icon

Retrieval-Augmented Text-to-Audio Generation

Add code
Bookmark button
Alert button
Sep 14, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Retrieval-Augmented Text-to-Audio Generation
Figure 2 for Retrieval-Augmented Text-to-Audio Generation
Figure 3 for Retrieval-Augmented Text-to-Audio Generation
Figure 4 for Retrieval-Augmented Text-to-Audio Generation
Viaarxiv icon

Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection Under Domain Shift

Add code
Bookmark button
Alert button
Sep 14, 2023
Haiyan Lan, Qiaoxi Zhu, Jian Guan, Yuming Wei, Wenwu Wang

Viaarxiv icon

AudioSR: Versatile Audio Super-resolution at Scale

Add code
Bookmark button
Alert button
Sep 13, 2023
Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley

Figure 1 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 2 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 3 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 4 for AudioSR: Versatile Audio Super-resolution at Scale
Viaarxiv icon

Multimodal Fish Feeding Intensity Assessment in Aquaculture

Add code
Bookmark button
Alert button
Sep 10, 2023
Meng Cui, Xubo Liu, Haohe Liu, Zhuangzhuang Du, Tao Chen, Guoping Lian, Daoliang Li, Wenwu Wang

Figure 1 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Figure 2 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Figure 3 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Figure 4 for Multimodal Fish Feeding Intensity Assessment in Aquaculture
Viaarxiv icon

Sparks of Large Audio Models: A Survey and Outlook

Add code
Bookmark button
Alert button
Sep 03, 2023
Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Björn W. Schuller

Figure 1 for Sparks of Large Audio Models: A Survey and Outlook
Figure 2 for Sparks of Large Audio Models: A Survey and Outlook
Figure 3 for Sparks of Large Audio Models: A Survey and Outlook
Figure 4 for Sparks of Large Audio Models: A Survey and Outlook
Viaarxiv icon

Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning

Add code
Bookmark button
Alert button
Aug 23, 2023
Yuanbo Hou, Siyang Song, Cheng Luo, Andrew Mitchell, Qiaoqiao Ren, Weicheng Xie, Jian Kang, Wenwu Wang, Dick Botteldooren

Figure 1 for Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning
Figure 2 for Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning
Figure 3 for Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning
Figure 4 for Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning
Viaarxiv icon

META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Aug 17, 2023
Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Ziying Yu, Wenwu Wang, Mark D. Plumbley, Jun Yang

Figure 1 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Figure 2 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Figure 3 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Viaarxiv icon

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Add code
Bookmark button
Alert button
Aug 10, 2023
Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley

Figure 1 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 2 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 3 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 4 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Viaarxiv icon

Separate Anything You Describe

Add code
Bookmark button
Alert button
Aug 09, 2023
Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Separate Anything You Describe
Figure 2 for Separate Anything You Describe
Figure 3 for Separate Anything You Describe
Figure 4 for Separate Anything You Describe
Viaarxiv icon