Picture for Adina Williams

Adina Williams

Meta AI

Changing Answer Order Can Decrease MMLU Accuracy

Add code
Jun 27, 2024
Figure 1 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 2 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 3 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 4 for Changing Answer Order Can Decrease MMLU Accuracy
Viaarxiv icon

Decomposed evaluations of geographic disparities in text-to-image models

Add code
Jun 17, 2024
Figure 1 for Decomposed evaluations of geographic disparities in text-to-image models
Figure 2 for Decomposed evaluations of geographic disparities in text-to-image models
Figure 3 for Decomposed evaluations of geographic disparities in text-to-image models
Figure 4 for Decomposed evaluations of geographic disparities in text-to-image models
Viaarxiv icon

The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More

Add code
Jun 07, 2024
Viaarxiv icon

Towards Geographic Inclusion in the Evaluation of Text-to-Image Models

Add code
May 07, 2024
Viaarxiv icon

The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models

Add code
Apr 24, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

Add code
Apr 09, 2024
Figure 1 for [Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Viaarxiv icon

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Add code
Mar 26, 2024
Figure 1 for Improving Text-to-Image Consistency via Automatic Prompt Optimization
Figure 2 for Improving Text-to-Image Consistency via Automatic Prompt Optimization
Figure 3 for Improving Text-to-Image Consistency via Automatic Prompt Optimization
Figure 4 for Improving Text-to-Image Consistency via Automatic Prompt Optimization
Viaarxiv icon

Compositional learning of functions in humans and machines

Add code
Mar 18, 2024
Figure 1 for Compositional learning of functions in humans and machines
Figure 2 for Compositional learning of functions in humans and machines
Figure 3 for Compositional learning of functions in humans and machines
Figure 4 for Compositional learning of functions in humans and machines
Viaarxiv icon

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models

Add code
Dec 21, 2023
Figure 1 for EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
Figure 2 for EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
Figure 3 for EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
Figure 4 for EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
Viaarxiv icon