Alert button
Picture for Davide Berghi

Davide Berghi

Alert button

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Add code
Bookmark button
Alert button
Dec 21, 2023
Davide Berghi, Philip J. B. Jackson

Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Dec 14, 2023
Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson

Figure 1 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 2 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 3 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Bookmark button
Alert button
Oct 23, 2023
Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang

Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

Audio Inputs for Active Speaker Detection and Localization via Microphone Array

Add code
Bookmark button
Alert button
Jul 27, 2023
Davide Berghi, Philip J. B. Jackson

Figure 1 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 2 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 3 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 4 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research

Add code
Bookmark button
Alert button
Dec 04, 2022
Davide Berghi, Marco Volino, Philip J. B. Jackson

Figure 1 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Figure 2 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Figure 3 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Figure 4 for Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research
Viaarxiv icon

Visually Supervised Speaker Detection and Localization via Microphone Array

Add code
Bookmark button
Alert button
Mar 07, 2022
Davide Berghi, Adrian Hilton, Philip J. B. Jackson

Figure 1 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 2 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 3 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 4 for Visually Supervised Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction

Add code
Bookmark button
Alert button
May 03, 2021
Hanne Stenzel, Davide Berghi, Marco Volino, Philip J. B. Jackson

Figure 1 for Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Figure 2 for Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Viaarxiv icon