Picture for Peipei Wu

Peipei Wu

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Figure 1 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 2 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 3 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing

Add code
Oct 11, 2023
Viaarxiv icon

Text-Driven Foley Sound Generation With Latent Diffusion Model

Add code
Jun 23, 2023
Figure 1 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 2 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 3 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 4 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Viaarxiv icon