Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Oct 07, 2020

Wanrong Zhu, Xin Eric Wang, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang

Figure 1 for Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Figure 2 for Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Figure 3 for Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Figure 4 for Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Share this with someone who'll enjoy it:

Abstract:A major challenge in visually grounded language generation is to build robust benchmark datasets and models that can generalize well in real-world settings. To do this, it is critical to ensure that our evaluation protocols are correct, and benchmarks are reliable. In this work, we set forth to design a set of experiments to understand an important but often ignored problem in visually grounded language generation: given that humans have different utilities and visual attention, how will the sample variance in multi-reference datasets affect the models' performance? Empirically, we study several multi-reference datasets and corresponding vision-and-language tasks. We show that it is of paramount importance to report variance in experiments; that human-generated references could vary drastically in different datasets/tasks, revealing the nature of each task; that metric-wise, CIDEr has shown systematically larger variances than others. Our evaluations on reference-per-instance shed light on the design of reliable datasets in the future.

* EMNLP 2020

View paper on

Share this with someone who'll enjoy it:

Title:Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Paper and Code