Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

Mar 26, 2016

Yevgeni Berzak, Andrei Barbu, Daniel Harari, Boris Katz, Shimon Ullman

Figure 1 for Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

Figure 2 for Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

Figure 3 for Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

Figure 4 for Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

Share this with someone who'll enjoy it:

Abstract:Understanding language goes hand in hand with the ability to integrate complex contextual information obtained via perception. In this work, we present a novel task for grounded language understanding: disambiguating a sentence given a visual scene which depicts one of the possible interpretations of that sentence. To this end, we introduce a new multimodal corpus containing ambiguous sentences, representing a wide range of syntactic, semantic and discourse ambiguities, coupled with videos that visualize the different interpretations for each sentence. We address this task by extending a vision model which determines if a sentence is depicted by a video. We demonstrate how such a model can be adjusted to recognize different interpretations of the same underlying sentence, allowing to disambiguate sentences in a unified fashion across the different ambiguity types.

* Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015, pages 1477--1487 * EMNLP 2015

View paper on

Share this with someone who'll enjoy it:

Title:Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

Paper and Code