Alert button
Picture for Qiuqiang Kong

Qiuqiang Kong

Alert button

a unified front-end framework for english text-to-speech synthesis

Add code
Bookmark button
Alert button
May 18, 2023
Zelin Ying, Chen Li, Yu Dong, Qiuqiang Kong, YuanYuan Huo, Yuping Wang, Yuxuan Wang

Figure 1 for a unified front-end framework for english text-to-speech synthesis
Figure 2 for a unified front-end framework for english text-to-speech synthesis
Figure 3 for a unified front-end framework for english text-to-speech synthesis
Figure 4 for a unified front-end framework for english text-to-speech synthesis
Viaarxiv icon

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion

Add code
Bookmark button
Alert button
May 12, 2023
Zhichao Wang, Liumeng Xue, Qiuqiang Kong, Lei Xie, Yuanzhe Chen, Qiao Tian, Yuping Wang

Figure 1 for Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion
Figure 2 for Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion
Figure 3 for Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion
Figure 4 for Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion
Viaarxiv icon

Universal Source Separation with Weakly Labelled Data

Add code
Bookmark button
Alert button
May 11, 2023
Qiuqiang Kong, Ke Chen, Haohe Liu, Xingjian Du, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Mark D. Plumbley

Figure 1 for Universal Source Separation with Weakly Labelled Data
Figure 2 for Universal Source Separation with Weakly Labelled Data
Figure 3 for Universal Source Separation with Weakly Labelled Data
Figure 4 for Universal Source Separation with Weakly Labelled Data
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Add code
Bookmark button
Alert button
Mar 30, 2023
Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang

Figure 1 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 2 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 3 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 4 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Viaarxiv icon

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training

Add code
Bookmark button
Alert button
Feb 02, 2023
Kin Wai Cheuk, Keunwoo Choi, Qiuqiang Kong, Bochen Li, Minz Won, Ju-Chiang Wang, Yun-Ning Hung, Dorien Herremans

Figure 1 for Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
Figure 2 for Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
Figure 3 for Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
Figure 4 for Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
Viaarxiv icon

Ontology-aware Learning and Evaluation for Audio Tagging

Add code
Bookmark button
Alert button
Nov 22, 2022
Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley

Figure 1 for Ontology-aware Learning and Evaluation for Audio Tagging
Figure 2 for Ontology-aware Learning and Evaluation for Audio Tagging
Figure 3 for Ontology-aware Learning and Evaluation for Audio Tagging
Figure 4 for Ontology-aware Learning and Evaluation for Audio Tagging
Viaarxiv icon

Binaural Rendering of Ambisonic Signals by Neural Networks

Add code
Bookmark button
Alert button
Nov 04, 2022
Yin Zhu, Qiuqiang Kong, Junjie Shi, Shilei Liu, Xuzhou Ye, Ju-chiang Wang, Junping Zhang

Figure 1 for Binaural Rendering of Ambisonic Signals by Neural Networks
Figure 2 for Binaural Rendering of Ambisonic Signals by Neural Networks
Figure 3 for Binaural Rendering of Ambisonic Signals by Neural Networks
Figure 4 for Binaural Rendering of Ambisonic Signals by Neural Networks
Viaarxiv icon

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention

Add code
Bookmark button
Alert button
Oct 28, 2022
Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, Lilian H. Tang, Mark D. Plumbley, Volkan Kılıç, Wenwu Wang

Figure 1 for Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
Figure 2 for Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Add code
Bookmark button
Alert button
Oct 27, 2022
Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 2 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 3 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 4 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Viaarxiv icon

Simple Pooling Front-ends For Efficient Audio Classification

Add code
Bookmark button
Alert button
Oct 07, 2022
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang

Figure 1 for Simple Pooling Front-ends For Efficient Audio Classification
Figure 2 for Simple Pooling Front-ends For Efficient Audio Classification
Figure 3 for Simple Pooling Front-ends For Efficient Audio Classification
Figure 4 for Simple Pooling Front-ends For Efficient Audio Classification
Viaarxiv icon