Picture for Ludwig Schmidt

Ludwig Schmidt

Shammie

Multilingual Diversity Improves Vision-Language Representations

Add code
May 27, 2024
Figure 1 for Multilingual Diversity Improves Vision-Language Representations
Figure 2 for Multilingual Diversity Improves Vision-Language Representations
Figure 3 for Multilingual Diversity Improves Vision-Language Representations
Figure 4 for Multilingual Diversity Improves Vision-Language Representations
Viaarxiv icon

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Add code
Apr 01, 2024
Viaarxiv icon

Do CLIPs Always Generalize Better than ImageNet Models?

Add code
Mar 18, 2024
Figure 1 for Do CLIPs Always Generalize Better than ImageNet Models?
Figure 2 for Do CLIPs Always Generalize Better than ImageNet Models?
Figure 3 for Do CLIPs Always Generalize Better than ImageNet Models?
Figure 4 for Do CLIPs Always Generalize Better than ImageNet Models?
Viaarxiv icon

Language models scale reliably with over-training and on downstream tasks

Add code
Mar 13, 2024
Figure 1 for Language models scale reliably with over-training and on downstream tasks
Figure 2 for Language models scale reliably with over-training and on downstream tasks
Figure 3 for Language models scale reliably with over-training and on downstream tasks
Figure 4 for Language models scale reliably with over-training and on downstream tasks
Viaarxiv icon

Benchmarking Distribution Shift in Tabular Data with TableShift

Add code
Dec 14, 2023
Viaarxiv icon

GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment

Add code
Oct 17, 2023
Figure 1 for GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Figure 2 for GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Figure 3 for GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Figure 4 for GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Viaarxiv icon

Data Filtering Networks

Add code
Oct 02, 2023
Figure 1 for Data Filtering Networks
Figure 2 for Data Filtering Networks
Figure 3 for Data Filtering Networks
Figure 4 for Data Filtering Networks
Viaarxiv icon

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

Add code
Aug 07, 2023
Figure 1 for OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Figure 2 for OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Figure 3 for OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Figure 4 for OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Viaarxiv icon

On the Connection between Pre-training Data Diversity and Fine-tuning Robustness

Add code
Jul 24, 2023
Figure 1 for On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Figure 2 for On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Figure 3 for On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Figure 4 for On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Viaarxiv icon

Improving Multimodal Datasets with Image Captioning

Add code
Jul 19, 2023
Figure 1 for Improving Multimodal Datasets with Image Captioning
Figure 2 for Improving Multimodal Datasets with Image Captioning
Figure 3 for Improving Multimodal Datasets with Image Captioning
Figure 4 for Improving Multimodal Datasets with Image Captioning
Viaarxiv icon