Picture for Markus Freitag

Markus Freitag

Searching for Difficult-to-Translate Test Examples at Scale

Add code
Sep 30, 2025
Viaarxiv icon

Deconstructing Self-Bias in LLM-generated Translation Benchmarks

Add code
Sep 30, 2025
Viaarxiv icon

Generating Difficult-to-Translate Texts

Add code
Sep 30, 2025
Viaarxiv icon

You Cannot Feed Two Birds with One Score: the Accuracy-Naturalness Tradeoff in Translation

Add code
Apr 01, 2025
Viaarxiv icon

Enhancing Human Evaluation in Machine Translation with Comparative Judgment

Add code
Feb 25, 2025
Figure 1 for Enhancing Human Evaluation in Machine Translation with Comparative Judgment
Figure 2 for Enhancing Human Evaluation in Machine Translation with Comparative Judgment
Figure 3 for Enhancing Human Evaluation in Machine Translation with Comparative Judgment
Figure 4 for Enhancing Human Evaluation in Machine Translation with Comparative Judgment
Viaarxiv icon

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

Add code
Feb 18, 2025
Viaarxiv icon

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation

Add code
Jan 30, 2025
Figure 1 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Figure 2 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Figure 3 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Figure 4 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Viaarxiv icon

From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set

Add code
Nov 23, 2024
Figure 1 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 2 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 3 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 4 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Viaarxiv icon

Mitigating Metric Bias in Minimum Bayes Risk Decoding

Add code
Nov 05, 2024
Figure 1 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Figure 2 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Figure 3 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Figure 4 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Viaarxiv icon

Learning from others' mistakes: Finetuning machine translation models with span-level error annotations

Add code
Oct 21, 2024
Figure 1 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 2 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 3 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 4 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Viaarxiv icon