Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cory Paik

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Oct 15, 2021

Cory Paik, Stéphane Aroca-Ouellette, Alessandro Roncone, Katharina Kann

Figure 1 for The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Figure 2 for The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Figure 3 for The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Figure 4 for The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Abstract:Recent work has raised concerns about the inherent limitations of text-only pretraining. In this paper, we first demonstrate that reporting bias, the tendency of people to not state the obvious, is one of the causes of this limitation, and then investigate to what extent multimodal training can mitigate this issue. To accomplish this, we 1) generate the Color Dataset (CoDa), a dataset of human-perceived color distributions for 521 common objects; 2) use CoDa to analyze and compare the color distribution found in text, the distribution captured by language models, and a human's perception of color; and 3) investigate the performance differences between text-only and multimodal models on CoDa. Our results show that the distribution of colors that a language model recovers correlates more strongly with the inaccurate distribution found in text than with the ground-truth, supporting the claim that reporting bias negatively impacts and inherently limits text-only training. We then demonstrate that multimodal models can leverage their visual training to mitigate these effects, providing a promising avenue for future research.

* Accepted to EMNLP 2021, 9 Pages

Via

Access Paper or Ask Questions

PROST: Physical Reasoning of Objects through Space and Time

Jun 07, 2021

Stéphane Aroca-Ouellette, Cory Paik, Alessandro Roncone, Katharina Kann

Abstract:We present a new probing dataset named PROST: Physical Reasoning about Objects Through Space and Time. This dataset contains 18,736 multiple-choice questions made from 14 manually curated templates, covering 10 physical reasoning concepts. All questions are designed to probe both causal and masked language models in a zero-shot setting. We conduct an extensive analysis which demonstrates that state-of-the-art pretrained models are inadequate at physical reasoning: they are influenced by the order in which answer options are presented to them, they struggle when the superlative in a question is inverted (e.g., most <-> least), and increasing the amount of pretraining data and parameters only yields minimal improvements. These results provide support for the hypothesis that current pretrained models' ability to reason about physical interactions is inherently limited by a lack of real world experience. By highlighting these limitations, we hope to motivate the development of models with a human-like understanding of the physical world.

* Accepted to ACL-Findings 2021, 9 Pages

Via

Access Paper or Ask Questions