M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

Add code
Jan 22, 2024
Figure 1 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Figure 2 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Figure 3 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Figure 4 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: