Alert button

UAVM: A Unified Model for Audio-Visual Learning

Jul 29, 2022
Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James Glass

Figure 1 for UAVM: A Unified Model for Audio-Visual Learning
Figure 2 for UAVM: A Unified Model for Audio-Visual Learning
Figure 3 for UAVM: A Unified Model for Audio-Visual Learning
Figure 4 for UAVM: A Unified Model for Audio-Visual Learning

Share this with someone who'll enjoy it:

Conventional audio-visual models have independent audio and video branches. We design a unified model for audio and video processing called Unified Audio-Visual Model (UAVM). In this paper, we describe UAVM, report its new state-of-the-art audio-visual event classification accuracy of 65.8% on VGGSound, and describe the intriguing properties of the model.

View paper onarxiv icon

Share this with someone who'll enjoy it: