Picture for Darina Koishigarina

Darina Koishigarina

CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally

Add code
Feb 05, 2025
Figure 1 for CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally
Figure 2 for CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally
Figure 3 for CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally
Figure 4 for CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally
Viaarxiv icon