Alert button
Picture for Tom Kocmi

Tom Kocmi

Alert button

Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

Add code
Bookmark button
Alert button
Jan 29, 2024
Nikita Moghe, Arnisa Fazla, Chantal Amrhein, Tom Kocmi, Mark Steedman, Alexandra Birch, Rico Sennrich, Liane Guillou

Viaarxiv icon

Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies

Add code
Bookmark button
Alert button
Jan 12, 2024
Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post

Viaarxiv icon

GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4

Add code
Bookmark button
Alert button
Oct 21, 2023
Tom Kocmi, Christian Federmann

Figure 1 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Figure 2 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Figure 3 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Figure 4 for GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Viaarxiv icon

SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window

Add code
Bookmark button
Alert button
Sep 16, 2023
Vikas Raunak, Tom Kocmi, Matt Post

Figure 1 for SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
Figure 2 for SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
Figure 3 for SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
Viaarxiv icon

Large Language Models Are State-of-the-Art Evaluators of Translation Quality

Add code
Bookmark button
Alert button
Feb 28, 2023
Tom Kocmi, Christian Federmann

Figure 1 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 2 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 3 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 4 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Viaarxiv icon

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

Add code
Bookmark button
Alert button
Jan 28, 2023
Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan

Figure 1 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Figure 2 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Figure 3 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Figure 4 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Viaarxiv icon

The Reality of Multi-Lingual Machine Translation

Add code
Bookmark button
Alert button
Feb 25, 2022
Tom Kocmi, Dominik Macháček, Ondřej Bojar

Figure 1 for The Reality of Multi-Lingual Machine Translation
Figure 2 for The Reality of Multi-Lingual Machine Translation
Figure 3 for The Reality of Multi-Lingual Machine Translation
Figure 4 for The Reality of Multi-Lingual Machine Translation
Viaarxiv icon

To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation

Add code
Bookmark button
Alert button
Jul 22, 2021
Tom Kocmi, Christian Federmann, Roman Grundkiewicz, Marcin Junczys-Dowmunt, Hitokazu Matsushita, Arul Menezes

Figure 1 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Figure 2 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Figure 3 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Figure 4 for To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Viaarxiv icon

On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs

Add code
Bookmark button
Alert button
Apr 21, 2021
Roman Grundkiewicz, Marcin Junczys-Dowmunt, Christian Federmann, Tom Kocmi

Figure 1 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Figure 2 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Figure 3 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Figure 4 for On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs
Viaarxiv icon