Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Axel-Cyrille Ngonga Ngomo

Data Science Group, Paderborn University, Germany

Convolutional Complex Knowledge Graph Embeddings

Aug 10, 2020

Caglar Demir, Axel-Cyrille Ngonga Ngomo

Figure 1 for Convolutional Complex Knowledge Graph Embeddings

Figure 2 for Convolutional Complex Knowledge Graph Embeddings

Figure 3 for Convolutional Complex Knowledge Graph Embeddings

Figure 4 for Convolutional Complex Knowledge Graph Embeddings

Abstract:In this paper, we study the problem of learning continuous vector representations of knowledge graphs for predicting missing links. We present a new approach called ConEx, which infers missing links by leveraging the composition of a 2D convolution with a Hermitian inner product of complex-valued embedding vectors. We evaluate ConEx against state-of-the-art approaches on the WN18RR, FB15K-237, KINSHIP and UMLS benchmark datasets. Our experimental results show that ConEx achieves a performance superior to that of state-of-the-art approaches such as RotatE, QuatE and TuckER on the link prediction task on all datasets while requiring at least 8 times fewer parameters. We ensure the reproducibility of our results by providing an open-source implementation which includes the training, evaluation scripts along with pre-trained models at https://github.com/conex-kge/ConEx.

Via

Access Paper or Ask Questions

Knowledge Graphs

Mar 28, 2020

Aidan Hogan, Eva Blomqvist, Michael Cochez, Claudia d'Amato, Gerard de Melo, Claudio Gutierrez, José Emilio Labra Gayo, Sabrina Kirrane, Sebastian Neumaier, Axel Polleres(+8 more)

Abstract:In this paper we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting diverse, dynamic, large-scale collections of data. After a general introduction, we motivate and contrast various graph-based data models and query languages that are used for knowledge graphs. We discuss the roles of schema, identity, and context in knowledge graphs. We explain how knowledge can be represented and extracted using a combination of deductive and inductive techniques. We summarise methods for the creation, enrichment, quality assessment, refinement, and publication of knowledge graphs. We provide an overview of prominent open knowledge graphs and enterprise knowledge graphs, their applications, and how they use the aforementioned techniques. We conclude with high-level future research directions for knowledge graphs.

* Revision from previous version: - Fixing flight companies in Figure 3 and changing some other details - Giving Figure 4 analogous data to Figure 3 for easier comparison - Updating discussion of the figures in Section 2.1.3. - Updating Example B.6 to reflect the new Figure 4. - Minor formatting change for Figure 27

Via

Access Paper or Ask Questions

A Physical Embedding Model for Knowledge Graphs

Jan 21, 2020

Caglar Demir, Axel-Cyrille Ngonga Ngomo

Figure 1 for A Physical Embedding Model for Knowledge Graphs

Figure 2 for A Physical Embedding Model for Knowledge Graphs

Figure 3 for A Physical Embedding Model for Knowledge Graphs

Figure 4 for A Physical Embedding Model for Knowledge Graphs

Abstract:Knowledge graph embedding methods learn continuous vector representations for entities in knowledge graphs and have been used successfully in a large number of applications. We present a novel and scalable paradigm for the computation of knowledge graph embeddings, which we dub PYKE . Our approach combines a physical model based on Hooke's law and its inverse with ideas from simulated annealing to compute embeddings for knowledge graphs efficiently. We prove that PYKE achieves a linear space complexity. While the time complexity for the initialization of our approach is quadratic, the time complexity of each of its iterations is linear in the size of the input knowledge graph. Hence, PYKE's overall runtime is close to linear. Consequently, our approach easily scales up to knowledge graphs containing millions of triples. We evaluate our approach against six state-of-the-art embedding approaches on the DrugBank and DBpedia datasets in two series of experiments. The first series shows that the cluster purity achieved by PYKE is up to 26% (absolute) better than that of the state of art. In addition, PYKE is more than 22 times faster than existing embedding solutions in the best case. The results of our second series of experiments show that PYKE is up to 23% (absolute) better than the state of art on the task of type prediction while maintaining its superior scalability. Our implementation and results are open-source and are available at http://github.com/dice-group/PYKE.

* 9th Joint International Conference, JIST 2019, Hangzhou, China

Via

Access Paper or Ask Questions

A Holistic Natural Language Generation Framework for the Semantic Web

Nov 04, 2019

Axel-Cyrille Ngonga Ngomo, Diego Moussallem, Lorenz Bühmann

Figure 1 for A Holistic Natural Language Generation Framework for the Semantic Web

Figure 2 for A Holistic Natural Language Generation Framework for the Semantic Web

Abstract:With the ever-growing generation of data for the Semantic Web comes an increasing demand for this data to be made available to non-semantic Web experts. One way of achieving this goal is to translate the languages of the Semantic Web into natural language. We present LD2NL, a framework for verbalizing the three key languages of the Semantic Web, i.e., RDF, OWL, and SPARQL. Our framework is based on a bottom-up approach to verbalization. We evaluated LD2NL in an open survey with 86 persons. Our results suggest that our framework can generate verbalizations that are close to natural languages and that can be easily understood by non-experts. Therewith, it enables non-domain experts to interpret Semantic Web data with more than 91\% of the accuracy of domain experts.

* International Conference Recent Advances in Natural Language Processing

Via

Access Paper or Ask Questions

Semantic Web for Machine Translation: Challenges and Directions

Jul 23, 2019

Diego Moussallem, Matthias Wauer, Axel-Cyrille Ngonga Ngomo

Abstract:A large number of machine translation approaches have recently been developed to facilitate the fluid migration of content across languages. However, the literature suggests that many obstacles must still be dealt with to achieve better automatic translations. One of these obstacles is lexical and syntactic ambiguity. A promising way of overcoming this problem is using Semantic Web technologies. This article is an extended abstract of our systematic review on machine translation approaches that rely on Semantic Web technologies for improving the translation of texts. Overall, we present the challenges and opportunities in the use of Semantic Web technologies in Machine Translation. Moreover, our research suggests that while Semantic Web technologies can enhance the quality of machine translation outputs for various problems, the combination of both is still in its infancy.

* Accepted at the Journal track of International Semantic Web conference (ISWC) 2019. arXiv admin note: substantial text overlap with arXiv:1711.09476

Via

Access Paper or Ask Questions

Augmenting Neural Machine Translation with Knowledge Graphs

Feb 23, 2019

Diego Moussallem, Mihael Arčan, Axel-Cyrille Ngonga Ngomo, Paul Buitelaar

Figure 1 for Augmenting Neural Machine Translation with Knowledge Graphs

Figure 2 for Augmenting Neural Machine Translation with Knowledge Graphs

Figure 3 for Augmenting Neural Machine Translation with Knowledge Graphs

Figure 4 for Augmenting Neural Machine Translation with Knowledge Graphs

Abstract:While neural networks have been used extensively to make substantial progress in the machine translation task, they are known for being heavily dependent on the availability of large amounts of training data. Recent efforts have tried to alleviate the data sparsity problem by augmenting the training data using different strategies, such as back-translation. Along with the data scarcity, the out-of-vocabulary words, mostly entities and terminological expressions, pose a difficult challenge to Neural Machine Translation systems. In this paper, we hypothesize that knowledge graphs enhance the semantic feature extraction of neural models, thus optimizing the translation of entities and terminological expressions in texts and consequently leading to a better translation quality. We hence investigate two different strategies for incorporating knowledge graphs into neural models without modifying the neural network architectures. We also examine the effectiveness of our augmentation method to recurrent and non-recurrent (self-attentional) neural architectures. Our knowledge graph augmented neural translation model, dubbed KG-NMT, achieves significant and consistent improvements of +3 BLEU, METEOR and chrF3 on average on the newstest datasets between 2014 and 2018 for WMT English-German translation task.

Via

Access Paper or Ask Questions

BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

Nov 01, 2018

Axel-Cyrille Ngonga Ngomo, Michael Röder, Diego Moussallem, Ricardo Usbeck, René Speck

Figure 1 for BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

Figure 2 for BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

Figure 3 for BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

Figure 4 for BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

Abstract:The manual creation of gold standards for named entity recognition and entity linking is time- and resource-intensive. Moreover, recent works show that such gold standards contain a large proportion of mistakes in addition to being difficult to maintain. We hence present BENGAL, a novel automatic generation of such gold standards as a complement to manually created benchmarks. The main advantage of our benchmarks is that they can be readily generated at any time. They are also cost-effective while being guaranteed to be free of annotation errors. We compare the performance of 11 tools on benchmarks in English generated by BENGAL and on 16benchmarks created manually. We show that our approach can be ported easily across languages by presenting results achieved by 4 tools on both Brazilian Portuguese and Spanish. Overall, our results suggest that our automatic benchmark generation approach can create varied benchmarks that have characteristics similar to those of existing benchmarks. Our approach is open-source. Our experimental results are available at http://faturl.com/bengalexpinlg and the code at https://github.com/dice-group/BENGAL.

* Accepted at INLG 2018

Via

Access Paper or Ask Questions

Machine Translation using Semantic Web Technologies: A Survey

Jul 17, 2018

Diego Moussallem, Matthias Wauer, Axel-Cyrille Ngonga Ngomo

Figure 1 for Machine Translation using Semantic Web Technologies: A Survey

Figure 2 for Machine Translation using Semantic Web Technologies: A Survey

Figure 3 for Machine Translation using Semantic Web Technologies: A Survey

Figure 4 for Machine Translation using Semantic Web Technologies: A Survey

Abstract:A large number of machine translation approaches have recently been developed to facilitate the fluid migration of content across languages. However, the literature suggests that many obstacles must still be dealt with to achieve better automatic translations. One of these obstacles is lexical and syntactic ambiguity. A promising way of overcoming this problem is using Semantic Web technologies. This article presents the results of a systematic review of machine translation approaches that rely on Semantic Web technologies for translating texts. Overall, our survey suggests that while Semantic Web technologies can enhance the quality of machine translation outputs for various problems, the combination of both is still in its infancy.

* 23 pages, 2 figures, 4 tables

Via

Access Paper or Ask Questions

Entity Linking in 40 Languages using MAG

May 29, 2018

Diego Moussallem, Ricardo Usbeck, Michael Röder, Axel-Cyrille Ngonga Ngomo

Figure 1 for Entity Linking in 40 Languages using MAG

Figure 2 for Entity Linking in 40 Languages using MAG

Figure 3 for Entity Linking in 40 Languages using MAG

Abstract:A plethora of Entity Linking (EL) approaches has recently been developed. While many claim to be multilingual, the MAG (Multilingual AGDISTIS) approach has been shown recently to outperform the state of the art in multilingual EL on 7 languages. With this demo, we extend MAG to support EL in 40 different languages, including especially low-resources languages such as Ukrainian, Greek, Hungarian, Croatian, Portuguese, Japanese and Korean. Our demo relies on online web services which allow for an easy access to our entity linking approaches and can disambiguate against DBpedia and Wikidata. During the demo, we will show how to use MAG by means of POST requests as well as using its user-friendly web interface. All data used in the demo is available at https://hobbitdata.informatik.uni-leipzig.de/agdistis/

* Accepted at ESWC 2018

Via

Access Paper or Ask Questions

Expeditious Generation of Knowledge Graph Embeddings

Mar 21, 2018

Tommaso Soru, Stefano Ruberto, Diego Moussallem, Edgard Marx, Diego Esteves, Axel-Cyrille Ngonga Ngomo

Figure 1 for Expeditious Generation of Knowledge Graph Embeddings

Figure 2 for Expeditious Generation of Knowledge Graph Embeddings

Figure 3 for Expeditious Generation of Knowledge Graph Embeddings

Figure 4 for Expeditious Generation of Knowledge Graph Embeddings

Abstract:Knowledge Graph Embedding methods aim at representing entities and relations in a knowledge base as points or vectors in a continuous vector space. Several approaches using embeddings have shown promising results on tasks such as link prediction, entity recommendation, question answering, and triplet classification. However, only a few methods can compute low-dimensional embeddings of very large knowledge bases. In this paper, we propose KG2Vec, a novel approach to Knowledge Graph Embedding based on the skip-gram model. Instead of using a predefined scoring function, we learn it relying on Long Short-Term Memories. We evaluated the goodness of our embeddings on knowledge graph completion and show that KG2Vec is comparable to the quality of the scalable state-of-the-art approaches and can process large graphs by parsing more than a hundred million triples in less than 6 hours on common hardware.

* Submitted, 6 pages

Via

Access Paper or Ask Questions