Alert button
Picture for Philip J. B. Jackson

Philip J. B. Jackson

Alert button

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Add code
Bookmark button
Alert button
Dec 21, 2023
Davide Berghi, Philip J. B. Jackson

Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Dec 14, 2023
Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson

Figure 1 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 2 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 3 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Bookmark button
Alert button
Oct 23, 2023
Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang

Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

Add code
Bookmark button
Alert button
Aug 09, 2023
Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton

Figure 1 for PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
Figure 2 for PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
Figure 3 for PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
Figure 4 for PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
Viaarxiv icon

Audio Inputs for Active Speaker Detection and Localization via Microphone Array

Add code
Bookmark button
Alert button
Jul 27, 2023
Davide Berghi, Philip J. B. Jackson

Figure 1 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 2 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 3 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 4 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research

Add code
Bookmark button
Alert button
Dec 04, 2022
Davide Berghi, Marco Volino, Philip J. B. Jackson

Figure 1 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Figure 2 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Figure 3 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Figure 4 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Viaarxiv icon

Visually Supervised Speaker Detection and Localization via Microphone Array

Add code
Bookmark button
Alert button
Mar 07, 2022
Davide Berghi, Adrian Hilton, Philip J. B. Jackson

Figure 1 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 2 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 3 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 4 for Visually Supervised Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction

Add code
Bookmark button
Alert button
May 03, 2021
Hanne Stenzel, Davide Berghi, Marco Volino, Philip J. B. Jackson

Figure 1 for Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Figure 2 for Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Viaarxiv icon

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks

Add code
Bookmark button
Alert button
Jun 14, 2019
Qiuqiang Kong, Yong Xu, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley

Figure 1 for Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
Figure 2 for Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
Figure 3 for Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
Figure 4 for Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
Viaarxiv icon

Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging

Add code
Bookmark button
Alert button
Nov 29, 2016
Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley

Figure 1 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Figure 2 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Figure 3 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Figure 4 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Viaarxiv icon