Alert button

VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment

Add code
Bookmark button
Alert button
Oct 09, 2022
Shraman Pramanick, Li Jing, Sayan Nag, Jiachen Zhu, Hardik Shah, Yann LeCun, Rama Chellappa

Figure 1 for VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment
Figure 2 for VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment
Figure 3 for VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment
Figure 4 for VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: