Alert button
Picture for Markus Freitag

Markus Freitag

Alert button

Finding Replicable Human Evaluations via Stable Ranking Probability

Add code
Bookmark button
Alert button
Apr 01, 2024
Parker Riley, Daniel Deutsch, George Foster, Viresh Ratnakar, Ali Dabirmoghaddam, Markus Freitag

Viaarxiv icon

Pinpoint, Not Criticize: Refining Large Language Models via Fine-Grained Actionable Feedback

Add code
Bookmark button
Alert button
Nov 15, 2023
Wenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li, Markus Freitag

Viaarxiv icon

There's no Data Like Better Data: Using QE Metrics for MT Data Filtering

Add code
Bookmark button
Alert button
Nov 09, 2023
Jan-Thorsten Peter, David Vilar, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Markus Freitag

Viaarxiv icon

Quality Control at Your Fingertips: Quality-Aware Translation Models

Add code
Bookmark button
Alert button
Oct 10, 2023
Christian Tomani, David Vilar, Markus Freitag, Colin Cherry, Subhajit Naskar, Mara Finkelstein, Daniel Cremers

Figure 1 for Quality Control at Your Fingertips: Quality-Aware Translation Models
Figure 2 for Quality Control at Your Fingertips: Quality-Aware Translation Models
Figure 3 for Quality Control at Your Fingertips: Quality-Aware Translation Models
Figure 4 for Quality Control at Your Fingertips: Quality-Aware Translation Models
Viaarxiv icon

MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods

Add code
Bookmark button
Alert button
Sep 28, 2023
Mara Finkelstein, Markus Freitag

Viaarxiv icon

Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level

Add code
Bookmark button
Alert button
Aug 28, 2023
Daniel Deutsch, Juraj Juraska, Mara Finkelstein, Markus Freitag

Figure 1 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Figure 2 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Figure 3 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Figure 4 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Viaarxiv icon

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

Add code
Bookmark button
Alert button
Aug 14, 2023
Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André F. T. Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat

Figure 1 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Figure 2 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Figure 3 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Figure 4 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Viaarxiv icon

Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation

Add code
Bookmark button
Alert button
May 23, 2023
Daniel Deutsch, George Foster, Markus Freitag

Figure 1 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Figure 2 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Figure 3 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Figure 4 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Viaarxiv icon

INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback

Add code
Bookmark button
Alert button
May 23, 2023
Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Yang Wang, Lei Li

Figure 1 for INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback
Figure 2 for INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback
Figure 3 for INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback
Figure 4 for INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback
Viaarxiv icon