Alert button
Picture for Wenwu Wang

Wenwu Wang

Alert button

WavJourney: Compositional Audio Creation with Large Language Models

Add code
Bookmark button
Alert button
Jul 26, 2023
Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon

Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks

Add code
Bookmark button
Alert button
Jun 27, 2023
Xiangpeng Ou, Ying Qiu, Ming Luo, Fujun Sun, Peng Zhang, Gang Yang, Junjie Li, Jianfeng Gao, Xiaobin He, Anyan Du, Bo Tang, Bin Li, Zichen Liu, Zhihua Li, Ling Xie, Xi Xiao, Jun Luo, Wenwu Wang, Jin Tao, Yan Yang

Figure 1 for Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks
Figure 2 for Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks
Figure 3 for Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks
Figure 4 for Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks
Viaarxiv icon

Text-Driven Foley Sound Generation With Latent Diffusion Model

Add code
Bookmark button
Alert button
Jun 23, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang

Figure 1 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 2 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 3 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 4 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Viaarxiv icon

Knowledge Distillation for Efficient Audio-Visual Video Captioning

Add code
Bookmark button
Alert button
Jun 16, 2023
Özkan Çaylı, Xubo Liu, Volkan Kılıç, Wenwu Wang

Figure 1 for Knowledge Distillation for Efficient Audio-Visual Video Captioning
Figure 2 for Knowledge Distillation for Efficient Audio-Visual Video Captioning
Figure 3 for Knowledge Distillation for Efficient Audio-Visual Video Captioning
Viaarxiv icon

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning

Add code
Bookmark button
Alert button
May 30, 2023
Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kılıç, Mark D. Plumbley, Wenwu Wang

Figure 1 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Figure 2 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Figure 3 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Figure 4 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Viaarxiv icon

Adapting Language-Audio Models as Few-Shot Audio Learners

Add code
Bookmark button
Alert button
May 28, 2023
Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang

Figure 1 for Adapting Language-Audio Models as Few-Shot Audio Learners
Figure 2 for Adapting Language-Audio Models as Few-Shot Audio Learners
Figure 3 for Adapting Language-Audio Models as Few-Shot Audio Learners
Figure 4 for Adapting Language-Audio Models as Few-Shot Audio Learners
Viaarxiv icon

Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7

Add code
Bookmark button
Alert button
May 25, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Figure 2 for Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Figure 3 for Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Viaarxiv icon

Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection

Add code
Bookmark button
Alert button
May 05, 2023
Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang

Figure 1 for Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection
Figure 2 for Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection
Figure 3 for Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection
Figure 4 for Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection
Viaarxiv icon

Graph Attention for Automated Audio Captioning

Add code
Bookmark button
Alert button
Apr 10, 2023
Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Wenwu Wang

Figure 1 for Graph Attention for Automated Audio Captioning
Figure 2 for Graph Attention for Automated Audio Captioning
Figure 3 for Graph Attention for Automated Audio Captioning
Viaarxiv icon