Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nandana Mihindukulasooriya

Scaling Knowledge Graphs for Automating AI of Digital Twins

Oct 26, 2022

Joern Ploennigs, Konstantinos Semertzidis, Fabio Lorenzi, Nandana Mihindukulasooriya

Abstract:Digital Twins are digital representations of systems in the Internet of Things (IoT) that are often based on AI models that are trained on data from those systems. Semantic models are used increasingly to link these datasets from different stages of the IoT systems life-cycle together and to automatically configure the AI modelling pipelines. This combination of semantic models with AI pipelines running on external datasets raises unique challenges particular if rolled out at scale. Within this paper we will discuss the unique requirements of applying semantic graphs to automate Digital Twins in different practical use cases. We will introduce the benchmark dataset DTBM that reflects these characteristics and look into the scaling challenges of different knowledge graph technologies. Based on these insights we will propose a reference architecture that is in-use in multiple products in IBM and derive lessons learned for scaling knowledge graphs for configuring AI models for Digital Twins.

* Accepted at ISWC 2022

Via

Access Paper or Ask Questions

KnowGL: Knowledge Generation and Linking from Text

Oct 25, 2022

Gaetano Rossiello, Faisal Chowdhury, Nandana Mihindukulasooriya, Owen Cornec, Alfio Gliozzo

Figure 1 for KnowGL: Knowledge Generation and Linking from Text

Abstract:We propose KnowGL, a tool that allows converting text into structured relational data represented as a set of ABox assertions compliant with the TBox of a given Knowledge Graph (KG), such as Wikidata. We address this problem as a sequence generation task by leveraging pre-trained sequence-to-sequence language models, e.g. BART. Given a sentence, we fine-tune such models to detect pairs of entity mentions and jointly generate a set of facts consisting of the full set of semantic annotations for a KG, such as entity labels, entity types, and their relationships. To showcase the capabilities of our tool, we build a web application consisting of a set of UI widgets that help users to navigate through the semantic data extracted from a given input text. We make the KnowGL model available at https://huggingface.co/ibm/knowgl-large.

* AAAI-23 Demo Track

Via

Access Paper or Ask Questions

Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

Jul 18, 2022

Nandana Mihindukulasooriya, Mike Sava, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Irene Yachbes, Aditya Gidh, Jillian Duckwitz, Kovit Nisar, Michael Santos, Alfio Gliozzo

Figure 1 for Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

Figure 2 for Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

Figure 3 for Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

Figure 4 for Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

Abstract:A research division plays an important role of driving innovation in an organization. Drawing insights, following trends, keeping abreast of new research, and formulating strategies are increasingly becoming more challenging for both researchers and executives as the amount of information grows in both velocity and volume. In this paper we present a use case of how a corporate research community, IBM Research, utilizes Semantic Web technologies to induce a unified Knowledge Graph from both structured and textual data obtained by integrating various applications used by the community related to research projects, academic papers, datasets, achievements and recognition. In order to make the Knowledge Graph more accessible to application developers, we identified a set of common patterns for exploiting the induced knowledge and exposed them as APIs. Those patterns were born out of user research which identified the most valuable use cases or user pain points to be alleviated. We outline two distinct scenarios: recommendation and analytics for business use. We will discuss these scenarios in detail and provide an empirical evaluation on entity recommendation specifically. The methodology used and the lessons learned from this work can be applied to other organizations facing similar challenges.

* Accepted at ISWC 2022

Via

Access Paper or Ask Questions

CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

Apr 18, 2022

Dung Thai, Srinivas Ravishankar, Ibrahim Abdelaziz, Mudit Chaudhary, Nandana Mihindukulasooriya, Tahira Naseem, Rajarshi Das, Pavan Kapanipathi, Achille Fokoue, Andrew McCallum

Figure 1 for CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

Figure 2 for CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

Figure 3 for CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

Figure 4 for CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

Abstract:Knowledge bases (KBs) are often incomplete and constantly changing in practice. Yet, in many question answering applications coupled with knowledge bases, the sparse nature of KBs is often overlooked. To this end, we propose a case-based reasoning approach, CBR-iKB, for knowledge base question answering (KBQA) with incomplete-KB as our main focus. Our method ensembles decisions from multiple reasoning chains with a novel nonparametric reasoning algorithm. By design, CBR-iKB can seamlessly adapt to changes in KBs without any task-specific training or fine-tuning. Our method achieves 100% accuracy on MetaQA and establishes new state-of-the-art on multiple benchmarks. For instance, CBR-iKB achieves an accuracy of 70% on WebQSP under the incomplete-KB setting, outperforming the existing state-of-the-art method by 22.3%.

* 8 pages, 3 figurs, 4 tables

Via

Access Paper or Ask Questions

KGI: An Integrated Framework for Knowledge Intensive Language Tasks

Apr 08, 2022

Md Faisal Mahbub Chowdhury, Michael Glass, Gaetano Rossiello, Alfio Gliozzo, Nandana Mihindukulasooriya

Figure 1 for KGI: An Integrated Framework for Knowledge Intensive Language Tasks

Figure 2 for KGI: An Integrated Framework for Knowledge Intensive Language Tasks

Figure 3 for KGI: An Integrated Framework for Knowledge Intensive Language Tasks

Figure 4 for KGI: An Integrated Framework for Knowledge Intensive Language Tasks

Abstract:In a recent work, we presented a novel state-of-the-art approach to zero-shot slot filling that extends dense passage retrieval with hard negatives and robust training procedures for retrieval augmented generation models. In this paper, we propose a system based on an enhanced version of this approach where we train task specific models for other knowledge intensive language tasks, such as open domain question answering (QA), dialogue and fact checking. Our system achieves results comparable to the best models in the KILT leaderboards. Moreover, given a user query, we show how the output from these different models can be combined to cross-examine each other. Particularly, we show how accuracy in dialogue can be improved using the QA model. A short video demonstrating the system is available here - \url{https://ibm.box.com/v/kgi-interactive-demo} .

Via

Access Paper or Ask Questions

A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

Jan 15, 2022

Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus(+15 more)

Figure 1 for A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

Figure 2 for A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

Figure 3 for A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

Figure 4 for A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

Abstract:Knowledge Base Question Answering (KBQA) tasks that involve complex reasoning are emerging as an important research direction. However, most existing KBQA datasets focus primarily on generic multi-hop reasoning over explicit facts, largely ignoring other reasoning types such as temporal, spatial, and taxonomic reasoning. In this paper, we present a benchmark dataset for temporal reasoning, TempQA-WD, to encourage research in extending the present approaches to target a more challenging set of complex reasoning tasks. Specifically, our benchmark is a temporal question answering dataset with the following advantages: (a) it is based on Wikidata, which is the most frequently curated, openly available knowledge base, (b) it includes intermediate sparql queries to facilitate the evaluation of semantic parsing based approaches for KBQA, and (c) it generalizes to multiple knowledge bases: Freebase and Wikidata. The TempQA-WD dataset is available at https://github.com/IBM/tempqa-wd.

* 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430

Via

Access Paper or Ask Questions

Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation

Jan 14, 2022

Md Faisal Mahbub Chowdhury, Gaetano Rossiello, Michael Glass, Nandana Mihindukulasooriya, Alfio Gliozzo

Figure 1 for Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation

Figure 2 for Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation

Figure 3 for Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation

Abstract:In recent years, a number of keyphrase generation (KPG) approaches were proposed consisting of complex model architectures, dedicated training paradigms and decoding strategies. In this work, we opt for simplicity and show how a commonly used seq2seq language model, BART, can be easily adapted to generate keyphrases from the text in a single batch computation using a simple training procedure. Empirical results on five benchmarks show that our approach is as good as the existing state-of-the-art KPG systems, but using a much simpler and easy to deploy framework.

Via

Access Paper or Ask Questions

Semantic Answer Type and Relation Prediction Task (SMART 2021)

Jan 10, 2022

Nandana Mihindukulasooriya, Mohnish Dubey, Alfio Gliozzo, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck, Gaetano Rossiello, Uttam Kumar

Figure 1 for Semantic Answer Type and Relation Prediction Task (SMART 2021)

Figure 2 for Semantic Answer Type and Relation Prediction Task (SMART 2021)

Figure 3 for Semantic Answer Type and Relation Prediction Task (SMART 2021)

Figure 4 for Semantic Answer Type and Relation Prediction Task (SMART 2021)

Abstract:Each year the International Semantic Web Conference organizes a set of Semantic Web Challenges to establish competitions that will advance state-of-the-art solutions in some problem domains. The Semantic Answer Type and Relation Prediction Task (SMART) task is one of the ISWC 2021 Semantic Web challenges. This is the second year of the challenge after a successful SMART 2020 at ISWC 2020. This year's version focuses on two sub-tasks that are very important to Knowledge Base Question Answering (KBQA): Answer Type Prediction and Relation Prediction. Question type and answer type prediction can play a key role in knowledge base question answering systems providing insights about the expected answer that are helpful to generate correct queries or rank the answer candidates. More concretely, given a question in natural language, the first task is, to predict the answer type using a target ontology (e.g., DBpedia or Wikidata. Similarly, the second task is to identify relations in the natural language query and link them to the relations in a target ontology. This paper discusses the task descriptions, benchmark datasets, and evaluation metrics. For more information, please visit https://smart-task.github.io/2021/.

Via

Access Paper or Ask Questions

Learning to Transpile AMR into SPARQL

Dec 15, 2021

Mihaela Bornea, Ramon Fernandez Astudillo, Tahira Naseem, Nandana Mihindukulasooriya, Ibrahim Abdelaziz, Pavan Kapanipathi, Radu Florian, Salim Roukos

Figure 1 for Learning to Transpile AMR into SPARQL

Figure 2 for Learning to Transpile AMR into SPARQL

Figure 3 for Learning to Transpile AMR into SPARQL

Figure 4 for Learning to Transpile AMR into SPARQL

Abstract:We propose a transition-based system to transpile Abstract Meaning Representation (AMR) into SPARQL for Knowledge Base Question Answering (KBQA). This allows to delegate part of the abstraction problem to a strongly pre-trained semantic parser, while learning transpiling with small amount of paired data. We departure from recent work relating AMR and SPARQL constructs, but rather than applying a set of rules, we teach the BART model to selectively use these relations. Further, we avoid explicitly encoding AMR but rather encode the parser state in the attention mechanism of BART, following recent semantic parsing works. The resulting model is simple, provides supporting text for its decisions, and outperforms recent progress in AMR-based KBQA in LC-QuAD (F1 53.4), matching it in QALD (F1 30.8), while exploiting the same inductive biases.

Via

Access Paper or Ask Questions

SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

Sep 28, 2021

Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus(+14 more)

Figure 1 for SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

Figure 2 for SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

Figure 3 for SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

Figure 4 for SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

Abstract:Knowledge Base Question Answering (KBQA) tasks that in-volve complex reasoning are emerging as an important re-search direction. However, most KBQA systems struggle withgeneralizability, particularly on two dimensions: (a) acrossmultiple reasoning types where both datasets and systems haveprimarily focused on multi-hop reasoning, and (b) across mul-tiple knowledge bases, where KBQA approaches are specif-ically tuned to a single knowledge base. In this paper, wepresent SYGMA, a modular approach facilitating general-izability across multiple knowledge bases and multiple rea-soning types. Specifically, SYGMA contains three high levelmodules: 1) KB-agnostic question understanding module thatis common across KBs 2) Rules to support additional reason-ing types and 3) KB-specific question mapping and answeringmodule to address the KB-specific aspects of the answer ex-traction. We demonstrate effectiveness of our system by evalu-ating on datasets belonging to two distinct knowledge bases,DBpedia and Wikidata. In addition, to demonstrate extensi-bility to additional reasoning types we evaluate on multi-hopreasoning datasets and a new Temporal KBQA benchmarkdataset on Wikidata, namedTempQA-WD1, introduced in thispaper. We show that our generalizable approach has bettercompetetive performance on multiple datasets on DBpediaand Wikidata that requires both multi-hop and temporal rea-soning

Via

Access Paper or Ask Questions