Alert button
Picture for Christian Federmann

Christian Federmann

Alert button

Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies

Add code
Bookmark button
Alert button
Jan 12, 2024
Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post

Viaarxiv icon

GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4

Add code
Bookmark button
Alert button
Oct 21, 2023
Tom Kocmi, Christian Federmann

Figure 1 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Figure 2 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Figure 3 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Figure 4 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Viaarxiv icon

Large Language Models Are State-of-the-Art Evaluators of Translation Quality

Add code
Bookmark button
Alert button
Feb 28, 2023
Tom Kocmi, Christian Federmann

Figure 1 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 2 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 3 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 4 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Viaarxiv icon

The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task

Add code
Bookmark button
Alert button
Sep 17, 2021
Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Christian Federmann, Philipp Koehn

Figure 1 for The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task
Figure 2 for The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task
Figure 3 for The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task
Figure 4 for The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task
Viaarxiv icon

To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation

Add code
Bookmark button
Alert button
Jul 22, 2021
Tom Kocmi, Christian Federmann, Roman Grundkiewicz, Marcin Junczys-Dowmunt, Hitokazu Matsushita, Arul Menezes

Figure 1 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Figure 2 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Figure 3 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Figure 4 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Viaarxiv icon

On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs

Add code
Bookmark button
Alert button
Apr 21, 2021
Roman Grundkiewicz, Marcin Junczys-Dowmunt, Christian Federmann, Tom Kocmi

Figure 1 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Figure 2 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Figure 3 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Figure 4 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Viaarxiv icon

Achieving Human Parity on Automatic Chinese to English News Translation

Add code
Bookmark button
Alert button
Jun 29, 2018
Hany Hassan, Anthony Aue, Chang Chen, Vishal Chowdhary, Jonathan Clark, Christian Federmann, Xuedong Huang, Marcin Junczys-Dowmunt, William Lewis, Mu Li, Shujie Liu, Tie-Yan Liu, Renqian Luo, Arul Menezes, Tao Qin, Frank Seide, Xu Tan, Fei Tian, Lijun Wu, Shuangzhi Wu, Yingce Xia, Dongdong Zhang, Zhirui Zhang, Ming Zhou

Figure 1 for Achieving Human Parity on Automatic Chinese to English News Translation
Figure 2 for Achieving Human Parity on Automatic Chinese to English News Translation
Figure 3 for Achieving Human Parity on Automatic Chinese to English News Translation
Figure 4 for Achieving Human Parity on Automatic Chinese to English News Translation
Viaarxiv icon