Picture for Clayton Fields

Clayton Fields

ESsEN: Training Compact Discriminative Vision-Language Transformers in a Low-Resource Setting

Add code
Apr 20, 2026
Viaarxiv icon

Renaissance: Investigating the Pretraining of Vision-Language Encoders

Add code
Nov 11, 2024
Figure 1 for Renaissance: Investigating the Pretraining of Vision-Language Encoders
Figure 2 for Renaissance: Investigating the Pretraining of Vision-Language Encoders
Figure 3 for Renaissance: Investigating the Pretraining of Vision-Language Encoders
Figure 4 for Renaissance: Investigating the Pretraining of Vision-Language Encoders
Viaarxiv icon

Vision Language Transformers: A Survey

Add code
Jul 06, 2023
Figure 1 for Vision Language Transformers: A Survey
Figure 2 for Vision Language Transformers: A Survey
Figure 3 for Vision Language Transformers: A Survey
Figure 4 for Vision Language Transformers: A Survey
Viaarxiv icon