Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Darío Garigliotti

Relative Drawing Identification Complexity is Invariant to Modality in Vision-Language Models

May 14, 2025

Diogo Freitas, Brigt Håvardstun, Cèsar Ferri, Darío Garigliotti, Jan Arne Telle, José Hernández-Orallo

Abstract:Large language models have become multimodal, and many of them are said to integrate their modalities using common representations. If this were true, a drawing of a car as an image, for instance, should map to the similar area in the latent space as a textual description of the strokes that conform the drawing. To explore this in a black-box access regime to these models, we propose the use of machine teaching, a theory that studies the minimal set of examples a teacher needs to choose so that the learner captures the concept. In this paper we evaluate the complexity of teaching visual-language models a subset of objects in the Quick, Draw! dataset using two presentations: raw images as bitmaps and trace coordinates in TikZ format. The results indicate that image-based representations generally require fewer segments and achieve higher accuracy than coordinate-based representations. But, surprisingly, the teaching size usually ranks concepts similarly across both modalities, even when controlling for (a human proxy of) concept priors, suggesting that the simplicity of concepts may be an inherent property that transcends modality representations.

* 54 pages (42 pages of appendix)

Via

Access Paper or Ask Questions

Semi-supervised Learning for Word Sense Disambiguation

Aug 26, 2019

Darío Garigliotti

Figure 1 for Semi-supervised Learning for Word Sense Disambiguation

Figure 2 for Semi-supervised Learning for Word Sense Disambiguation

Figure 3 for Semi-supervised Learning for Word Sense Disambiguation

Abstract:This work is a study of the impact of multiple aspects in a classic unsupervised word sense disambiguation algorithm. We identify relevant factors in a decision rule algorithm, including the initial labeling of examples, the formalization of the rule confidence, and the criteria for accepting a decision rule. Some of these factors are only implicitly considered in the original literature. We then propose a lightly supervised version of the algorithm, and employ a pseudo-word-based strategy to evaluate the impact of these factors. The obtained performances are comparable with those of highly optimized formulations of the word sense disambiguation method.

* This work was awarded the Third Place in the EST 2013 Contest (ISSN 1850-2946) at the 42nd JAIIO (Annals of 42nd JAIIO - Argentine Journals of Informatics - ISSN 1850-2776)

Via

Access Paper or Ask Questions

Unsupervised Context Retrieval for Long-tail Entities

Aug 05, 2019

Darío Garigliotti, Dyaa Albakour, Miguel Martinez, Krisztian Balog

Figure 1 for Unsupervised Context Retrieval for Long-tail Entities

Figure 2 for Unsupervised Context Retrieval for Long-tail Entities

Figure 3 for Unsupervised Context Retrieval for Long-tail Entities

Abstract:Monitoring entities in media streams often relies on rich entity representations, like structured information available in a knowledge base (KB). For long-tail entities, such monitoring is highly challenging, due to their limited, if not entirely missing, representation in the reference KB. In this paper, we address the problem of retrieving textual contexts for monitoring long-tail entities. We propose an unsupervised method to overcome the limited representation of long-tail entities by leveraging established entities and their contexts as support information. Evaluation on a purpose-built test collection shows the suitability of our approach and its robustness for out-of-KB entities.

* Proceedings of the 2019 ACM International Conference on Theory of Information Retrieval (ICTIR' 19)

Via

Access Paper or Ask Questions

NeuType: A Simple and Effective Neural Network Approach for Predicting Missing Entity Type Information in Knowledge Bases

Jul 05, 2019

Jon Arne Bø Hovda, Darío Garigliotti, Krisztian Balog

Figure 1 for NeuType: A Simple and Effective Neural Network Approach for Predicting Missing Entity Type Information in Knowledge Bases

Figure 2 for NeuType: A Simple and Effective Neural Network Approach for Predicting Missing Entity Type Information in Knowledge Bases

Abstract:Knowledge bases store information about the semantic types of entities, which can be utilized in a range of information access tasks. This information, however, is often incomplete, due to new entities emerging on a daily basis. We address the task of automatically assigning types to entities in a knowledge base from a type taxonomy. Specifically, we present two neural network architectures, which take short entity descriptions and, optionally, information about related entities as input. Using the DBpedia knowledge base for experimental evaluation, we demonstrate that these simple architectures yield significant improvements over the current state of the art.

Via

Access Paper or Ask Questions

IntentsKB: A Knowledge Base of Entity-Oriented Search Intents

Sep 02, 2018

Darío Garigliotti, Krisztian Balog

Figure 1 for IntentsKB: A Knowledge Base of Entity-Oriented Search Intents

Figure 2 for IntentsKB: A Knowledge Base of Entity-Oriented Search Intents

Figure 3 for IntentsKB: A Knowledge Base of Entity-Oriented Search Intents

Figure 4 for IntentsKB: A Knowledge Base of Entity-Oriented Search Intents

Abstract:We address the problem of constructing a knowledge base of entity-oriented search intents. Search intents are defined on the level of entity types, each comprising of a high-level intent category (property, website, service, or other), along with a cluster of query terms used to express that intent. These machine-readable statements can be leveraged in various applications, e.g., for generating entity cards or query recommendations. By structuring service-oriented search intents, we take one step towards making entities actionable. The main contribution of this paper is a pipeline of components we develop to construct a knowledge base of entity intents. We evaluate performance both component-wise and end-to-end, and demonstrate that our approach is able to generate high-quality data.

* Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM'18), 2018. 4 pages. 2 figures

Via

Access Paper or Ask Questions

Towards an Understanding of Entity-Oriented Search Intents

Feb 22, 2018

Darío Garigliotti, Krisztian Balog

Figure 1 for Towards an Understanding of Entity-Oriented Search Intents

Figure 2 for Towards an Understanding of Entity-Oriented Search Intents

Abstract:Entity-oriented search deals with a wide variety of information needs, from displaying direct answers to interacting with services. In this work, we aim to understand what are prominent entity-oriented search intents and how they can be fulfilled. We develop a scheme of entity intent categories, and use them to annotate a sample of queries. Specifically, we annotate unique query refiners on the level of entity types. We observe that, on average, over half of those refiners seek to interact with a service, while over a quarter of the refiners search for information that may be looked up in a knowledge base.

* Advances in Information Retrieval. Proceedings of the 40th European Conference on Information Retrieval (ECIR '18), 2018

Via

Access Paper or Ask Questions

Generating High-Quality Query Suggestion Candidates for Task-Based Search

Feb 22, 2018

Heng Ding, Shuo Zhang, Darío Garigliotti, Krisztian Balog

Figure 1 for Generating High-Quality Query Suggestion Candidates for Task-Based Search

Figure 2 for Generating High-Quality Query Suggestion Candidates for Task-Based Search

Abstract:We address the task of generating query suggestions for task-based search. The current state of the art relies heavily on suggestions provided by a major search engine. In this paper, we solve the task without reliance on search engines. Specifically, we focus on the first step of a two-stage pipeline approach, which is dedicated to the generation of query suggestion candidates. We present three methods for generating candidate suggestions and apply them on multiple information sources. Using a purpose-built test collection, we find that these methods are able to generate high-quality suggestion candidates.

* Advances in Information Retrieval. Proceedings of the 40th European Conference on Information Retrieval (ECIR '18), 2018

Via

Access Paper or Ask Questions

On Type-Aware Entity Retrieval

Aug 28, 2017

Darío Garigliotti, Krisztian Balog

Figure 1 for On Type-Aware Entity Retrieval

Figure 2 for On Type-Aware Entity Retrieval

Figure 3 for On Type-Aware Entity Retrieval

Figure 4 for On Type-Aware Entity Retrieval

Abstract:Today, the practice of returning entities from a knowledge base in response to search queries has become widespread. One of the distinctive characteristics of entities is that they are typed, i.e., assigned to some hierarchically organized type system (type taxonomy). The primary objective of this paper is to gain a better understanding of how entity type information can be utilized in entity retrieval. We perform this investigation in an idealized "oracle" setting, assuming that we know the distribution of target types of the relevant entities for a given query. We perform a thorough analysis of three main aspects: (i) the choice of type taxonomy, (ii) the representation of hierarchical type information, and (iii) the combination of type-based and term-based similarity in the retrieval model. Using a standard entity search test collection based on DBpedia, we find that type information proves most useful when using large type taxonomies that provide very specific types. We provide further insights on the extensional coverage of entities and on the utility of target types.

* Proceedings of the 3rd ACM International Conference on the Theory of Information Retrieval (ICTIR '17), 2017

Via

Access Paper or Ask Questions

Generating Query Suggestions to Support Task-Based Search

Aug 28, 2017

Darío Garigliotti, Krisztian Balog

Figure 1 for Generating Query Suggestions to Support Task-Based Search

Figure 2 for Generating Query Suggestions to Support Task-Based Search

Figure 3 for Generating Query Suggestions to Support Task-Based Search

Figure 4 for Generating Query Suggestions to Support Task-Based Search

Abstract:We address the problem of generating query suggestions to support users in completing their underlying tasks (which motivated them to search in the first place). Given an initial query, these query suggestions should provide a coverage of possible subtasks the user might be looking for. We propose a probabilistic modeling framework that obtains keyphrases from multiple sources and generates query suggestions from these keyphrases. Using the test suites of the TREC Tasks track, we evaluate and analyze each component of our model.

* Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17), 2017

Via

Access Paper or Ask Questions