Picture for Kawin Ethayarajh

Kawin Ethayarajh

Data Checklist: On Unit-Testing Datasets with Usable Information

Add code
Aug 06, 2024
Figure 1 for Data Checklist: On Unit-Testing Datasets with Usable Information
Figure 2 for Data Checklist: On Unit-Testing Datasets with Usable Information
Figure 3 for Data Checklist: On Unit-Testing Datasets with Usable Information
Figure 4 for Data Checklist: On Unit-Testing Datasets with Usable Information
Viaarxiv icon

KTO: Model Alignment as Prospect Theoretic Optimization

Add code
Feb 02, 2024
Figure 1 for KTO: Model Alignment as Prospect Theoretic Optimization
Figure 2 for KTO: Model Alignment as Prospect Theoretic Optimization
Figure 3 for KTO: Model Alignment as Prospect Theoretic Optimization
Figure 4 for KTO: Model Alignment as Prospect Theoretic Optimization
Viaarxiv icon

Anchor Points: Benchmarking Models with Much Fewer Examples

Add code
Sep 14, 2023
Figure 1 for Anchor Points: Benchmarking Models with Much Fewer Examples
Figure 2 for Anchor Points: Benchmarking Models with Much Fewer Examples
Figure 3 for Anchor Points: Benchmarking Models with Much Fewer Examples
Figure 4 for Anchor Points: Benchmarking Models with Much Fewer Examples
Viaarxiv icon

How Human is Human Evaluation? Improving the Gold Standard for NLG with Utility Theory

Add code
May 24, 2022
Figure 1 for How Human is Human Evaluation? Improving the Gold Standard for NLG with Utility Theory
Figure 2 for How Human is Human Evaluation? Improving the Gold Standard for NLG with Utility Theory
Figure 3 for How Human is Human Evaluation? Improving the Gold Standard for NLG with Utility Theory
Figure 4 for How Human is Human Evaluation? Improving the Gold Standard for NLG with Utility Theory
Viaarxiv icon

Richer Countries and Richer Representations

Add code
May 10, 2022
Figure 1 for Richer Countries and Richer Representations
Figure 2 for Richer Countries and Richer Representations
Figure 3 for Richer Countries and Richer Representations
Figure 4 for Richer Countries and Richer Representations
Viaarxiv icon

Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words

Add code
May 10, 2022
Figure 1 for Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words
Figure 2 for Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words
Figure 3 for Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words
Figure 4 for Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words
Viaarxiv icon

Information-Theoretic Measures of Dataset Difficulty

Add code
Oct 16, 2021
Figure 1 for Information-Theoretic Measures of Dataset Difficulty
Figure 2 for Information-Theoretic Measures of Dataset Difficulty
Figure 3 for Information-Theoretic Measures of Dataset Difficulty
Figure 4 for Information-Theoretic Measures of Dataset Difficulty
Viaarxiv icon

Conditional probing: measuring usable information beyond a baseline

Add code
Sep 19, 2021
Figure 1 for Conditional probing: measuring usable information beyond a baseline
Figure 2 for Conditional probing: measuring usable information beyond a baseline
Figure 3 for Conditional probing: measuring usable information beyond a baseline
Figure 4 for Conditional probing: measuring usable information beyond a baseline
Viaarxiv icon

On the Opportunities and Risks of Foundation Models

Add code
Aug 18, 2021
Figure 1 for On the Opportunities and Risks of Foundation Models
Figure 2 for On the Opportunities and Risks of Foundation Models
Figure 3 for On the Opportunities and Risks of Foundation Models
Figure 4 for On the Opportunities and Risks of Foundation Models
Viaarxiv icon

Attention Flows are Shapley Value Explanations

Add code
May 31, 2021
Figure 1 for Attention Flows are Shapley Value Explanations
Viaarxiv icon