Alert button

VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts

Dec 04, 2021
Renrui Zhang, Longtian Qiu, Wei Zhang, Ziyao Zeng

Figure 1 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Figure 2 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Figure 3 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Figure 4 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: