Alert button
Picture for Daniel Deutsch

Daniel Deutsch

Alert button

On the Role of Summary Content Units in Text Summarization Evaluation

Add code
Bookmark button
Alert button
Apr 02, 2024
Marcel Nawrath, Agnieszka Nowak, Tristan Ratz, Danilo C. Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Lining Zhang, Sebastian Gehrmann, Saad Mahamood, Miruna Clinciu, Khyathi Chandu, Yufang Hou

Viaarxiv icon

Finding Replicable Human Evaluations via Stable Ranking Probability

Add code
Bookmark button
Alert button
Apr 01, 2024
Parker Riley, Daniel Deutsch, George Foster, Viresh Ratnakar, Ali Dabirmoghaddam, Markus Freitag

Viaarxiv icon

Pinpoint, Not Criticize: Refining Large Language Models via Fine-Grained Actionable Feedback

Add code
Bookmark button
Alert button
Nov 15, 2023
Wenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li, Markus Freitag

Viaarxiv icon

There's no Data Like Better Data: Using QE Metrics for MT Data Filtering

Add code
Bookmark button
Alert button
Nov 09, 2023
Jan-Thorsten Peter, David Vilar, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Markus Freitag

Viaarxiv icon

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics

Add code
Bookmark button
Alert button
Oct 30, 2023
Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger

Viaarxiv icon

Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level

Add code
Bookmark button
Alert button
Aug 28, 2023
Daniel Deutsch, Juraj Juraska, Mara Finkelstein, Markus Freitag

Figure 1 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Figure 2 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Figure 3 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Figure 4 for Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
Viaarxiv icon

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

Add code
Bookmark button
Alert button
Aug 14, 2023
Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André F. T. Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat

Figure 1 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Figure 2 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Figure 3 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Figure 4 for The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Viaarxiv icon

Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation

Add code
Bookmark button
Alert button
May 23, 2023
Daniel Deutsch, George Foster, Markus Freitag

Figure 1 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Figure 2 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Figure 3 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Figure 4 for Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation
Viaarxiv icon

Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization

Add code
Bookmark button
Alert button
Dec 28, 2022
Lining Zhang, João Sedoc, Simon Mille, Yufang Hou, Sebastian Gehrmann, Daniel Deutsch, Elizabeth Clark, Yixin Liu, Miruna Clinciu, Saad Mahamood, Khyathi Chandu

Figure 1 for Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization
Figure 2 for Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization
Figure 3 for Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization
Figure 4 for Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization
Viaarxiv icon