Picture for Piotr Bojanowski

Piotr Bojanowski

WILLOW, LIENS

Who Needs Labels? Adapting Vision Foundation Models With the Metadata You Already Have

Add code
Jun 03, 2026
Viaarxiv icon

Misalignment Between Backpropagation and the Hierarchy of Brain Responses to Images

Add code
May 27, 2026
Viaarxiv icon

VGGT-$Ω$

Add code
May 14, 2026
Viaarxiv icon

Efficient Universal Perception Encoder

Add code
Mar 23, 2026
Viaarxiv icon

Revisiting [CLS] and Patch Token Interaction in Vision Transformers

Add code
Feb 09, 2026
Viaarxiv icon

Disentangling the Factors of Convergence between Brains and Computer Vision Models

Add code
Aug 25, 2025
Viaarxiv icon

DINOv3

Add code
Aug 13, 2025
Viaarxiv icon

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Add code
Jun 11, 2025
Viaarxiv icon

Cluster and Predict Latents Patches for Improved Masked Image Modeling

Add code
Feb 12, 2025
Viaarxiv icon

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment

Add code
Dec 20, 2024
Figure 1 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 2 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 3 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 4 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Viaarxiv icon