WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM

Add code
Sep 26, 2025
Figure 1 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Figure 2 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Figure 3 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Figure 4 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: