Alert button
Picture for Wenwu Wang

Wenwu Wang

Alert button

Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining

Add code
Bookmark button
Alert button
Apr 10, 2023
Jian Guan, Feiyang Xiao, Youde Liu, Qiaoxi Zhu, Wenwu Wang

Figure 1 for Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining
Figure 2 for Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining
Figure 3 for Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining
Figure 4 for Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Add code
Bookmark button
Alert button
Mar 30, 2023
Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang

Figure 1 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 2 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 3 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 4 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Viaarxiv icon

Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study

Add code
Bookmark button
Alert button
Mar 11, 2023
Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang

Figure 1 for Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study
Figure 2 for Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study
Figure 3 for Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study
Figure 4 for Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study
Viaarxiv icon

Differentiable Bootstrap Particle Filters for Regime-Switching Models

Add code
Bookmark button
Alert button
Feb 20, 2023
Wenhan Li, Xiongjie Chen, Wenwu Wang, Víctor Elvira, Yunpeng Li

Figure 1 for Differentiable Bootstrap Particle Filters for Regime-Switching Models
Figure 2 for Differentiable Bootstrap Particle Filters for Regime-Switching Models
Figure 3 for Differentiable Bootstrap Particle Filters for Regime-Switching Models
Figure 4 for Differentiable Bootstrap Particle Filters for Regime-Switching Models
Viaarxiv icon

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

Add code
Bookmark button
Alert button
Feb 16, 2023
Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D. Plumbley

Figure 1 for AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Figure 2 for AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Figure 3 for AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Figure 4 for AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Viaarxiv icon

Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN

Add code
Bookmark button
Alert button
Dec 23, 2022
Yaozong Mo, Chaofeng Li, Wenqi Ren, Shaopeng Shang, Wenwu Wang, Xiao-jun Wu

Figure 1 for Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN
Figure 2 for Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN
Figure 3 for Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN
Figure 4 for Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN
Viaarxiv icon

Towards Generating Diverse Audio Captions via Adversarial Training

Add code
Bookmark button
Alert button
Dec 05, 2022
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang

Figure 1 for Towards Generating Diverse Audio Captions via Adversarial Training
Figure 2 for Towards Generating Diverse Audio Captions via Adversarial Training
Figure 3 for Towards Generating Diverse Audio Captions via Adversarial Training
Figure 4 for Towards Generating Diverse Audio Captions via Adversarial Training
Viaarxiv icon

ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation

Add code
Bookmark button
Alert button
Nov 23, 2022
Sara Atito, Muhammad Awais, Wenwu Wang, Mark D Plumbley, Josef Kittler

Figure 1 for ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
Figure 2 for ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
Figure 3 for ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
Figure 4 for ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
Viaarxiv icon

Ontology-aware Learning and Evaluation for Audio Tagging

Add code
Bookmark button
Alert button
Nov 22, 2022
Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley

Figure 1 for Ontology-aware Learning and Evaluation for Audio Tagging
Figure 2 for Ontology-aware Learning and Evaluation for Audio Tagging
Figure 3 for Ontology-aware Learning and Evaluation for Audio Tagging
Figure 4 for Ontology-aware Learning and Evaluation for Audio Tagging
Viaarxiv icon