Picture for André F. T. Martins

André F. T. Martins

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Add code
Jun 26, 2024
Figure 1 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 2 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 3 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 4 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Viaarxiv icon

QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

Add code
May 28, 2024
Viaarxiv icon

Can Automatic Metrics Assess High-Quality Translations?

Add code
May 28, 2024
Viaarxiv icon

XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

Add code
May 08, 2024
Viaarxiv icon

Conformal Prediction for Natural Language Processing: A Survey

Add code
May 03, 2024
Viaarxiv icon

Is Context Helpful for Chat Translation Evaluation?

Add code
Mar 13, 2024
Figure 1 for Is Context Helpful for Chat Translation Evaluation?
Figure 2 for Is Context Helpful for Chat Translation Evaluation?
Figure 3 for Is Context Helpful for Chat Translation Evaluation?
Figure 4 for Is Context Helpful for Chat Translation Evaluation?
Viaarxiv icon

Did Translation Models Get More Robust Without Anyone Even Noticing?

Add code
Mar 06, 2024
Figure 1 for Did Translation Models Get More Robust Without Anyone Even Noticing?
Figure 2 for Did Translation Models Get More Robust Without Anyone Even Noticing?
Figure 3 for Did Translation Models Get More Robust Without Anyone Even Noticing?
Figure 4 for Did Translation Models Get More Robust Without Anyone Even Noticing?
Viaarxiv icon

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

Add code
Feb 27, 2024
Figure 1 for Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Figure 2 for Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Figure 3 for Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Figure 4 for Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Viaarxiv icon

CroissantLLM: A Truly Bilingual French-English Language Model

Add code
Feb 02, 2024
Figure 1 for CroissantLLM: A Truly Bilingual French-English Language Model
Figure 2 for CroissantLLM: A Truly Bilingual French-English Language Model
Figure 3 for CroissantLLM: A Truly Bilingual French-English Language Model
Figure 4 for CroissantLLM: A Truly Bilingual French-English Language Model
Viaarxiv icon

Non-Exchangeable Conformal Language Generation with Nearest Neighbors

Add code
Feb 01, 2024
Figure 1 for Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Figure 2 for Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Figure 3 for Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Figure 4 for Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Viaarxiv icon