Picture for Piotr Bojanowski

Piotr Bojanowski

WILLOW, LIENS

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Add code
Jun 11, 2025
Viaarxiv icon

Cluster and Predict Latents Patches for Improved Masked Image Modeling

Add code
Feb 12, 2025
Viaarxiv icon

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment

Add code
Dec 20, 2024
Figure 1 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 2 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 3 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 4 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Viaarxiv icon

You Don't Need Data-Augmentation in Self-Supervised Learning

Add code
Jun 13, 2024
Figure 1 for You Don't Need Data-Augmentation in Self-Supervised Learning
Figure 2 for You Don't Need Data-Augmentation in Self-Supervised Learning
Figure 3 for You Don't Need Data-Augmentation in Self-Supervised Learning
Figure 4 for You Don't Need Data-Augmentation in Self-Supervised Learning
Viaarxiv icon

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Add code
May 24, 2024
Viaarxiv icon

Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning

Add code
May 02, 2024
Figure 1 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Figure 2 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Figure 3 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Figure 4 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Viaarxiv icon

Vision Transformers Need Registers

Add code
Sep 28, 2023
Figure 1 for Vision Transformers Need Registers
Figure 2 for Vision Transformers Need Registers
Figure 3 for Vision Transformers Need Registers
Figure 4 for Vision Transformers Need Registers
Viaarxiv icon

Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions

Add code
Apr 18, 2023
Figure 1 for Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Figure 2 for Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Figure 3 for Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Figure 4 for Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Viaarxiv icon

Sub-meter resolution canopy height maps using self-supervised learning and a vision transformer trained on Aerial and GEDI Lidar

Add code
Apr 17, 2023
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon