Picture for Alaaeldin El-Nouby

Alaaeldin El-Nouby

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Add code
Apr 10, 2025
Figure 1 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Figure 2 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Figure 3 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Figure 4 for Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models
Viaarxiv icon

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Add code
Feb 19, 2025
Viaarxiv icon

Multimodal Autoregressive Pre-training of Large Vision Encoders

Add code
Nov 21, 2024
Figure 1 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 2 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 3 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 4 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

Scalable Pre-training of Large Autoregressive Image Models

Add code
Jan 16, 2024
Figure 1 for Scalable Pre-training of Large Autoregressive Image Models
Figure 2 for Scalable Pre-training of Large Autoregressive Image Models
Figure 3 for Scalable Pre-training of Large Autoregressive Image Models
Figure 4 for Scalable Pre-training of Large Autoregressive Image Models
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
May 09, 2023
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon

Are Visual Recognition Models Robust to Image Compression?

Add code
Apr 10, 2023
Figure 1 for Are Visual Recognition Models Robust to Image Compression?
Figure 2 for Are Visual Recognition Models Robust to Image Compression?
Figure 3 for Are Visual Recognition Models Robust to Image Compression?
Viaarxiv icon

Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models

Add code
Jan 28, 2023
Figure 1 for Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models
Figure 2 for Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models
Figure 3 for Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models
Figure 4 for Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models
Viaarxiv icon

Image Compression with Product Quantized Masked Image Modeling

Add code
Dec 14, 2022
Figure 1 for Image Compression with Product Quantized Masked Image Modeling
Figure 2 for Image Compression with Product Quantized Masked Image Modeling
Figure 3 for Image Compression with Product Quantized Masked Image Modeling
Figure 4 for Image Compression with Product Quantized Masked Image Modeling
Viaarxiv icon