Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jens Lehmann

MDE: Multi Distance Embeddings for Link Prediction in Knowledge Graphs

Jun 10, 2019

Afshin Sadeghi, Damien Graux, Jens Lehmann

Figure 1 for MDE: Multi Distance Embeddings for Link Prediction in Knowledge Graphs

Abstract:Over the past decade, knowledge graphs became popular for capturing structured domain knowledge. Relational learning models enable the prediction of missing links inside knowledge graphs. More specifically, latent distance approaches model the relationships among entities via a distance between latent representations. Translating embedding models (e.g., TransE) are among the most popular latent distance approaches which use one distance function to learn multiple relation patterns. However, they are not capable of capturing symmetric relations. They also force relations with reflexive patterns to become symmetric and transitive. In order to improve distance based embedding, we propose multi-distance embeddings (MDE). Our solution is based on the idea that by learning independent embedding vectors for each entity and relation one can aggregate contrasting distance functions. Benefiting from MDE, we also develop supplementary distances resolving the above-mentioned limitations of TransE. We further propose an extended loss function for distance based embeddings and show that MDE and TransE are fully expressive using this loss function. Furthermore, we obtain a bound on the size of their embeddings for full expressivity. Our empirical results show that MDE significantly improves the translating embeddings and outperforms several state-of-the-art embedding models on benchmark datasets.

* Preprint. Under review

Via

Access Paper or Ask Questions

Soft Marginal TransE for Scholarly Knowledge Graph Completion

Apr 27, 2019

Mojtaba Nayyeri, Sahar Vahdati, Jens Lehmann, Hamed Shariat Yazdi

Figure 1 for Soft Marginal TransE for Scholarly Knowledge Graph Completion

Figure 2 for Soft Marginal TransE for Scholarly Knowledge Graph Completion

Figure 3 for Soft Marginal TransE for Scholarly Knowledge Graph Completion

Figure 4 for Soft Marginal TransE for Scholarly Knowledge Graph Completion

Abstract:Knowledge graphs (KGs), i.e. representation of information as a semantic graph, provide a significant test bed for many tasks including question answering, recommendation, and link prediction. Various amount of scholarly metadata have been made vailable as knowledge graphs from the diversity of data providers and agents. However, these high-quantities of data remain far from quality criteria in terms of completeness while growing at a rapid pace. Most of the attempts in completing such KGs are following traditional data digitization, harvesting and collaborative curation approaches. Whereas, advanced AI-related approaches such as embedding models - specifically designed for such tasks - are usually evaluated for standard benchmarks such as Freebase and Wordnet. The tailored nature of such datasets prevents those approaches to shed the lights on more accurate discoveries. Application of such models on domain-specific KGs takes advantage of enriched meta-data and provides accurate results where the underlying domain can enormously benefit. In this work, the TransE embedding model is reconciled for a specific link prediction task on scholarly metadata. The results show a significant shift in the accuracy and performance evaluation of the model on a dataset with scholarly metadata. The newly proposed version of TransE obtains 99.9% for link prediction task while original TransE gets 95%. In terms of accuracy and Hit@10, TransE outperforms other embedding models such as ComplEx, TransH and TransR experimented over scholarly knowledge graphs

Via

Access Paper or Ask Questions

Translating Natural Language to SQL using Pointer-Generator Networks and How Decoding Order Matters

Nov 13, 2018

Denis Lukovnikov, Nilesh Chakraborty, Jens Lehmann, Asja Fischer

Figure 1 for Translating Natural Language to SQL using Pointer-Generator Networks and How Decoding Order Matters

Figure 2 for Translating Natural Language to SQL using Pointer-Generator Networks and How Decoding Order Matters

Figure 3 for Translating Natural Language to SQL using Pointer-Generator Networks and How Decoding Order Matters

Figure 4 for Translating Natural Language to SQL using Pointer-Generator Networks and How Decoding Order Matters

Abstract:Translating natural language to SQL queries for table-based question answering is a challenging problem and has received significant attention from the research community. In this work, we extend a pointer-generator and investigate the order-matters problem in semantic parsing for SQL. Even though our model is a straightforward extension of a general-purpose pointer-generator, it outperforms early works for WikiSQL and remains competitive to concurrently introduced, more complex models. Moreover, we provide a deeper investigation of the potential order-matters problem that could arise due to having multiple correct decoding paths, and investigate the use of REINFORCE as well as a dynamic oracle in this context.

Via

Access Paper or Ask Questions

Improving Response Selection in Multi-Turn Dialogue Systems by Incorporating Domain Knowledge

Nov 05, 2018

Debanjan Chaudhuri, Agustinus Kristiadi, Jens Lehmann, Asja Fischer

Figure 1 for Improving Response Selection in Multi-Turn Dialogue Systems by Incorporating Domain Knowledge

Figure 2 for Improving Response Selection in Multi-Turn Dialogue Systems by Incorporating Domain Knowledge

Figure 3 for Improving Response Selection in Multi-Turn Dialogue Systems by Incorporating Domain Knowledge

Figure 4 for Improving Response Selection in Multi-Turn Dialogue Systems by Incorporating Domain Knowledge

Abstract:Building systems that can communicate with humans is a core problem in Artificial Intelligence. This work proposes a novel neural network architecture for response selection in an end-to-end multi-turn conversational dialogue setting. The architecture applies context level attention and incorporates additional external knowledge provided by descriptions of domain-specific words. It uses a bi-directional Gated Recurrent Unit (GRU) for encoding context and responses and learns to attend over the context words given the latent response representation and vice versa.In addition, it incorporates external domain specific information using another GRU for encoding the domain keyword descriptions. This allows better representation of domain-specific keywords in responses and hence improves the overall performance. Experimental results show that our model outperforms all other state-of-the-art methods for response selection in multi-turn conversations.

* Published as conference paper at CoNLL 2018

Via

Access Paper or Ask Questions

Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Nov 02, 2018

Gaurav Maheshwari, Priyansh Trivedi, Denis Lukovnikov, Nilesh Chakraborty, Asja Fischer, Jens Lehmann

Figure 1 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Figure 2 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Figure 3 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Figure 4 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Abstract:In this paper, we conduct an empirical investigation of neural query graph ranking approaches for the task of complex question answering over knowledge graphs. We experiment with six different ranking models and propose a novel self-attention based slot matching model which exploits the inherent structure of query graphs, our logical form of choice. Our proposed model generally outperforms the other models on two QA datasets over the DBpedia knowledge graph, evaluated in different settings. In addition, we show that transfer learning from the larger of those QA datasets to the smaller dataset yields substantial improvements, effectively offsetting the general lack of training data.

Via

Access Paper or Ask Questions

No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge Graph

Sep 26, 2018

Kuldeep Singh, Ioanna Lytra, Arun Sethupat Radhakrishna, Saeedeh Shekarpour, Maria-Esther Vidal, Jens Lehmann

Figure 1 for No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge Graph

Figure 2 for No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge Graph

Figure 3 for No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge Graph

Figure 4 for No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge Graph

Abstract:Question answering (QA) over knowledge graphs has gained significant momentum over the past five years due to the increasing availability of large knowledge graphs and the rising importance of question answering for user interaction. DBpedia has been the most prominently used knowledge graph in this setting and most approaches currently use a pipeline of processing steps connecting a sequence of components. In this article, we analyse and micro evaluate the behaviour of 29 available QA components for DBpedia knowledge graph that were released by the research community since 2010. As a result, we provide a perspective on collective failure cases, suggest characteristics of QA components that prevent them from performing better and provide future challenges and research directions for the field.

* Evaluation of State of the art Question Answering components performing entity linking, relation linking etc

Via

Access Paper or Ask Questions

Belittling the Source: Trustworthiness Indicators to Obfuscate Fake News on the Web

Sep 03, 2018

Diego Esteves, Aniketh Janardhan Reddy, Piyush Chawla, Jens Lehmann

Figure 1 for Belittling the Source: Trustworthiness Indicators to Obfuscate Fake News on the Web

Figure 2 for Belittling the Source: Trustworthiness Indicators to Obfuscate Fake News on the Web

Figure 3 for Belittling the Source: Trustworthiness Indicators to Obfuscate Fake News on the Web

Figure 4 for Belittling the Source: Trustworthiness Indicators to Obfuscate Fake News on the Web

Abstract:With the growth of the internet, the number of fake-news online has been proliferating every year. The consequences of such phenomena are manifold, ranging from lousy decision-making process to bullying and violence episodes. Therefore, fact-checking algorithms became a valuable asset. To this aim, an important step to detect fake-news is to have access to a credibility score for a given information source. However, most of the widely used Web indicators have either been shut-down to the public (e.g., Google PageRank) or are not free for use (Alexa Rank). Further existing databases are short-manually curated lists of online sources, which do not scale. Finally, most of the research on the topic is theoretical-based or explore confidential data in a restricted simulation environment. In this paper we explore current research, highlight the challenges and propose solutions to tackle the problem of classifying websites into a credibility scale. The proposed model automatically extracts source reputation cues and computes a credibility factor, providing valuable insights which can help in belittling dubious and confirming trustful unknown websites. Experimental results outperform state of the art in the 2-classes and 5-classes setting.

* EMNLP 2018: Conference on Empirical Methods in Natural Language Processing (The First Workshop on Fact Extraction and Verification)

Via

Access Paper or Ask Questions

How Complex is your classification problem? A survey on measuring classification complexity

Aug 10, 2018

Ana C. Lorena, Luís P. F. Garcia, Jens Lehmann, Marcilio C. P. Souto, Tin K. Ho

Figure 1 for How Complex is your classification problem? A survey on measuring classification complexity

Figure 2 for How Complex is your classification problem? A survey on measuring classification complexity

Figure 3 for How Complex is your classification problem? A survey on measuring classification complexity

Figure 4 for How Complex is your classification problem? A survey on measuring classification complexity

Abstract:Extracting characteristics from the training datasets of classification problems has proven effective in a number of meta-analyses. Among them, measures of classification complexity can estimate the difficulty in separating the data points into their expected classes. Descriptors of the spatial distribution of the data and estimates of the shape and size of the decision boundary are among the existent measures for this characterization. This information can support the formulation of new data-driven pre-processing and pattern recognition techniques, which can in turn be focused on challenging characteristics of the problems. This paper surveys and analyzes measures which can be extracted from the training datasets in order to characterize the complexity of the respective classification problems. Their use in recent literature is also reviewed and discussed, allowing to prospect opportunities for future work in the area. Finally, descriptions are given on an R package named Extended Complexity Library (ECoL) that implements a set of complexity measures and is made publicly available.

* Survey paper, 27 pages, 12 figures

Via

Access Paper or Ask Questions

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Jun 25, 2018

Mohnish Dubey, Debayan Banerjee, Debanjan Chaudhuri, Jens Lehmann

Figure 1 for EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Figure 2 for EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Figure 3 for EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Figure 4 for EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Abstract:Many question answering systems over knowledge graphs rely on entity and relation linking components in order to connect the natural language input to the underlying knowledge graph. Traditionally, entity linking and relation linking have been performed either as dependent sequential tasks or as independent parallel tasks. In this paper, we propose a framework called EARL, which performs entity linking and relation linking as a joint task. EARL implements two different solution strategies for which we provide a comparative analysis in this paper: The first strategy is a formalisation of the joint entity and relation linking tasks as an instance of the Generalised Travelling Salesman Problem (GTSP). In order to be computationally feasible, we employ approximate GTSP solvers. The second strategy uses machine learning in order to exploit the connection density between nodes in the knowledge graph. It relies on three base features and re-ranking steps in order to predict entities and relations. We compare the strategies and evaluate them on a dataset with 5000 questions. Both strategies significantly outperform the current state-of-the-art approaches for entity and relation linking.

* International Semantic Web Conference 2018

Via

Access Paper or Ask Questions

Incorporating Literals into Knowledge Graph Embeddings

May 25, 2018

Agustinus Kristiadi, Mohammad Asif Khan, Denis Lukovnikov, Jens Lehmann, Asja Fischer

Figure 1 for Incorporating Literals into Knowledge Graph Embeddings

Figure 2 for Incorporating Literals into Knowledge Graph Embeddings

Figure 3 for Incorporating Literals into Knowledge Graph Embeddings

Figure 4 for Incorporating Literals into Knowledge Graph Embeddings

Abstract:Knowledge graphs, on top of entities and their relationships, contain other important elements: literals. Literals encode interesting properties (e.g. the height) of entities that are not captured by links between entities alone. Most of the existing work on embedding (or latent feature) based knowledge graph analysis focuses mainly on the relations between entities. In this work, we study the effect of incorporating literal information into existing link prediction methods. Our approach, which we name LiteralE, is an extension that can be plugged into existing latent feature methods. LiteralE merges entity embeddings with their literal information using a learnable, parametrized function, such as a simple linear or nonlinear transformation, or a multilayer neural network. We extend several popular embedding models based on LiteralE and evaluate their performance on the task of link prediction. Despite its simplicity, LiteralE proves to be an effective way to incorporate literal information into existing embedding based methods, improving their performance on different standard datasets, which we augmented with their literals and provide as testbed for further research.

* 9 pages, 2 figures, 6 tables

Via

Access Paper or Ask Questions