Picture for Faisal Ladhak

Faisal Ladhak

STORYSUMM: Evaluating Faithfulness in Story Summarization

Add code
Jul 09, 2024
Viaarxiv icon

Aligning Large Language Models via Fine-grained Supervision

Add code
Jun 04, 2024
Figure 1 for Aligning Large Language Models via Fine-grained Supervision
Figure 2 for Aligning Large Language Models via Fine-grained Supervision
Figure 3 for Aligning Large Language Models via Fine-grained Supervision
Figure 4 for Aligning Large Language Models via Fine-grained Supervision
Viaarxiv icon

Proving Test Set Contamination in Black Box Language Models

Add code
Oct 26, 2023
Figure 1 for Proving Test Set Contamination in Black Box Language Models
Figure 2 for Proving Test Set Contamination in Black Box Language Models
Figure 3 for Proving Test Set Contamination in Black Box Language Models
Figure 4 for Proving Test Set Contamination in Black Box Language Models
Viaarxiv icon

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Add code
Sep 08, 2023
Figure 1 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Figure 2 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Figure 3 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Figure 4 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Viaarxiv icon

Generating EDU Extracts for Plan-Guided Summary Re-Ranking

Add code
May 28, 2023
Figure 1 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Figure 2 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Figure 3 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Figure 4 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Viaarxiv icon

Whose Opinions Do Language Models Reflect?

Add code
Mar 30, 2023
Figure 1 for Whose Opinions Do Language Models Reflect?
Figure 2 for Whose Opinions Do Language Models Reflect?
Figure 3 for Whose Opinions Do Language Models Reflect?
Figure 4 for Whose Opinions Do Language Models Reflect?
Viaarxiv icon

Benchmarking Large Language Models for News Summarization

Add code
Jan 31, 2023
Figure 1 for Benchmarking Large Language Models for News Summarization
Figure 2 for Benchmarking Large Language Models for News Summarization
Figure 3 for Benchmarking Large Language Models for News Summarization
Figure 4 for Benchmarking Large Language Models for News Summarization
Viaarxiv icon

Tracing and Removing Data Errors in Natural Language Generation Datasets

Add code
Dec 21, 2022
Figure 1 for Tracing and Removing Data Errors in Natural Language Generation Datasets
Figure 2 for Tracing and Removing Data Errors in Natural Language Generation Datasets
Figure 3 for Tracing and Removing Data Errors in Natural Language Generation Datasets
Figure 4 for Tracing and Removing Data Errors in Natural Language Generation Datasets
Viaarxiv icon

Evaluating Human-Language Model Interaction

Add code
Dec 20, 2022
Figure 1 for Evaluating Human-Language Model Interaction
Figure 2 for Evaluating Human-Language Model Interaction
Figure 3 for Evaluating Human-Language Model Interaction
Figure 4 for Evaluating Human-Language Model Interaction
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon