Alert button
Picture for Jianyuan Sun

Jianyuan Sun

Alert button

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Bookmark button
Alert button
Oct 23, 2023
Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang

Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning

Add code
Bookmark button
Alert button
May 30, 2023
Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kılıç, Mark D. Plumbley, Wenwu Wang

Figure 1 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Figure 2 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Figure 3 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Figure 4 for Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Viaarxiv icon

Towards Generating Diverse Audio Captions via Adversarial Training

Add code
Bookmark button
Alert button
Dec 05, 2022
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang

Figure 1 for Towards Generating Diverse Audio Captions via Adversarial Training
Figure 2 for Towards Generating Diverse Audio Captions via Adversarial Training
Figure 3 for Towards Generating Diverse Audio Captions via Adversarial Training
Figure 4 for Towards Generating Diverse Audio Captions via Adversarial Training
Viaarxiv icon

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention

Add code
Bookmark button
Alert button
Oct 28, 2022
Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, Lilian H. Tang, Mark D. Plumbley, Volkan Kılıç, Wenwu Wang

Figure 1 for Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
Figure 2 for Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
Viaarxiv icon

Automated Audio Captioning via Fusion of Low- and High- Dimensional Features

Add code
Bookmark button
Alert button
Oct 10, 2022
Jianyuan Sun, Xubo Liu, Xinhao Mei, Mark D. Plumbley, Volkan Kilic, Wenwu Wang

Figure 1 for Automated Audio Captioning via Fusion of Low- and High- Dimensional Features
Figure 2 for Automated Audio Captioning via Fusion of Low- and High- Dimensional Features
Figure 3 for Automated Audio Captioning via Fusion of Low- and High- Dimensional Features
Viaarxiv icon

On Metric Learning for Audio-Text Cross-Modal Retrieval

Add code
Bookmark button
Alert button
Apr 13, 2022
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang

Figure 1 for On Metric Learning for Audio-Text Cross-Modal Retrieval
Figure 2 for On Metric Learning for Audio-Text Cross-Modal Retrieval
Viaarxiv icon

Leveraging Pre-trained BERT for Audio Captioning

Add code
Bookmark button
Alert button
Mar 27, 2022
Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kılıç, Wenwu Wang

Figure 1 for Leveraging Pre-trained BERT for Audio Captioning
Figure 2 for Leveraging Pre-trained BERT for Audio Captioning
Figure 3 for Leveraging Pre-trained BERT for Audio Captioning
Figure 4 for Leveraging Pre-trained BERT for Audio Captioning
Viaarxiv icon

Deep Neural Decision Forest for Acoustic Scene Classification

Add code
Bookmark button
Alert button
Mar 07, 2022
Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kılıç, Wenwu Wang

Figure 1 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 2 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 3 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 4 for Deep Neural Decision Forest for Acoustic Scene Classification
Viaarxiv icon

Diverse Audio Captioning via Adversarial Training

Add code
Bookmark button
Alert button
Oct 13, 2021
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang

Figure 1 for Diverse Audio Captioning via Adversarial Training
Figure 2 for Diverse Audio Captioning via Adversarial Training
Figure 3 for Diverse Audio Captioning via Adversarial Training
Figure 4 for Diverse Audio Captioning via Adversarial Training
Viaarxiv icon