Alert button

Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point

Aug 13, 2018
Liane Guillou, Christian Hardmeier

Figure 1 for Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point
Figure 2 for Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point
Figure 3 for Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point
Figure 4 for Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point

Share this with someone who'll enjoy it:

We compare the performance of the APT and AutoPRF metrics for pronoun translation against a manually annotated dataset comprising human judgements as to the correctness of translations of the PROTEST test suite. Although there is some correlation with the human judgements, a range of issues limit the performance of the automated metrics. Instead, we recommend the use of semi-automatic metrics and test suites in place of fully automatic metrics.

* EMNLP 2018  
View paper onarxiv icon

Share this with someone who'll enjoy it: