Picture for Maxime Oquab

Maxime Oquab

DINOv3

Add code
Aug 13, 2025
Viaarxiv icon

Cluster and Predict Latents Patches for Improved Masked Image Modeling

Add code
Feb 12, 2025
Viaarxiv icon

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment

Add code
Dec 20, 2024
Figure 1 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 2 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 3 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 4 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Viaarxiv icon

You Don't Need Data-Augmentation in Self-Supervised Learning

Add code
Jun 13, 2024
Figure 1 for You Don't Need Data-Augmentation in Self-Supervised Learning
Figure 2 for You Don't Need Data-Augmentation in Self-Supervised Learning
Figure 3 for You Don't Need Data-Augmentation in Self-Supervised Learning
Figure 4 for You Don't Need Data-Augmentation in Self-Supervised Learning
Viaarxiv icon

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Add code
May 24, 2024
Viaarxiv icon

Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning

Add code
May 02, 2024
Figure 1 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Figure 2 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Figure 3 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Figure 4 for Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning
Viaarxiv icon

Vision Transformers Need Registers

Add code
Sep 28, 2023
Figure 1 for Vision Transformers Need Registers
Figure 2 for Vision Transformers Need Registers
Figure 3 for Vision Transformers Need Registers
Figure 4 for Vision Transformers Need Registers
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon

Co-training $2^L$ Submodels for Visual Recognition

Add code
Dec 09, 2022
Viaarxiv icon

Efficient conditioned face animation using frontally-viewed embedding

Add code
Mar 16, 2022
Figure 1 for Efficient conditioned face animation using frontally-viewed embedding
Figure 2 for Efficient conditioned face animation using frontally-viewed embedding
Figure 3 for Efficient conditioned face animation using frontally-viewed embedding
Figure 4 for Efficient conditioned face animation using frontally-viewed embedding
Viaarxiv icon