Picture for Emily Reif

Emily Reif

Who's asking? User personas and the mechanics of latent misalignment

Add code
Jun 17, 2024
Figure 1 for Who's asking? User personas and the mechanics of latent misalignment
Figure 2 for Who's asking? User personas and the mechanics of latent misalignment
Figure 3 for Who's asking? User personas and the mechanics of latent misalignment
Figure 4 for Who's asking? User personas and the mechanics of latent misalignment
Viaarxiv icon

Automatic Histograms: Leveraging Language Models for Text Dataset Exploration

Add code
Feb 21, 2024
Figure 1 for Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
Figure 2 for Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
Figure 3 for Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
Figure 4 for Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
Viaarxiv icon

Understanding the Dataset Practitioners Behind Large Language Model Development

Add code
Feb 21, 2024
Figure 1 for Understanding the Dataset Practitioners Behind Large Language Model Development
Figure 2 for Understanding the Dataset Practitioners Behind Large Language Model Development
Viaarxiv icon

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Add code
Feb 16, 2024
Viaarxiv icon

SoUnD Framework: Analyzing (So)cial Representation in (Un)structured (D)ata

Add code
Dec 01, 2023
Viaarxiv icon

Data Similarity is Not Enough to Explain Language Model Performance

Add code
Nov 15, 2023
Figure 1 for Data Similarity is Not Enough to Explain Language Model Performance
Figure 2 for Data Similarity is Not Enough to Explain Language Model Performance
Figure 3 for Data Similarity is Not Enough to Explain Language Model Performance
Figure 4 for Data Similarity is Not Enough to Explain Language Model Performance
Viaarxiv icon

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Add code
May 22, 2023
Figure 1 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 2 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 3 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 4 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Viaarxiv icon

Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models

Add code
May 19, 2023
Figure 1 for Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models
Figure 2 for Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models
Figure 3 for Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models
Figure 4 for Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

The Case for a Single Model that can Both Generate Continuations and Fill in the Blank

Add code
Jun 09, 2022
Figure 1 for The Case for a Single Model that can Both Generate Continuations and Fill in the Blank
Figure 2 for The Case for a Single Model that can Both Generate Continuations and Fill in the Blank
Figure 3 for The Case for a Single Model that can Both Generate Continuations and Fill in the Blank
Figure 4 for The Case for a Single Model that can Both Generate Continuations and Fill in the Blank
Viaarxiv icon