Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sören Auer

Exploring the Latest LLMs for Leaderboard Extraction

Jun 06, 2024

Salomon Kabongo, Jennifer D'Souza, Sören Auer

Figure 1 for Exploring the Latest LLMs for Leaderboard Extraction

Figure 2 for Exploring the Latest LLMs for Leaderboard Extraction

Figure 3 for Exploring the Latest LLMs for Leaderboard Extraction

Figure 4 for Exploring the Latest LLMs for Leaderboard Extraction

Abstract:The rapid advancements in Large Language Models (LLMs) have opened new avenues for automating complex tasks in AI research. This paper investigates the efficacy of different LLMs-Mistral 7B, Llama-2, GPT-4-Turbo and GPT-4.o in extracting leaderboard information from empirical AI research articles. We explore three types of contextual inputs to the models: DocTAET (Document Title, Abstract, Experimental Setup, and Tabular Information), DocREC (Results, Experiments, and Conclusions), and DocFULL (entire document). Our comprehensive study evaluates the performance of these models in generating (Task, Dataset, Metric, Score) quadruples from research papers. The findings reveal significant insights into the strengths and limitations of each model and context type, providing valuable guidance for future AI research automation efforts.

Via

Access Paper or Ask Questions

LLMs4OM: Matching Ontologies with Large Language Models

Apr 16, 2024

Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer

Figure 1 for LLMs4OM: Matching Ontologies with Large Language Models

Figure 2 for LLMs4OM: Matching Ontologies with Large Language Models

Figure 3 for LLMs4OM: Matching Ontologies with Large Language Models

Abstract:Ontology Matching (OM), is a critical task in knowledge integration, where aligning heterogeneous ontologies facilitates data interoperability and knowledge sharing. Traditional OM systems often rely on expert knowledge or predictive models, with limited exploration of the potential of Large Language Models (LLMs). We present the LLMs4OM framework, a novel approach to evaluate the effectiveness of LLMs in OM tasks. This framework utilizes two modules for retrieval and matching, respectively, enhanced by zero-shot prompting across three ontology representations: concept, concept-parent, and concept-children. Through comprehensive evaluations using 20 OM datasets from various domains, we demonstrate that LLMs, under the LLMs4OM framework, can match and even surpass the performance of traditional OM systems, particularly in complex matching scenarios. Our results highlight the potential of LLMs to significantly contribute to the field of OM.

* 8 pages, 1 figure, accepted to ESWC 2024 Special Track on LLMs for Knowledge Engineering (https://2024.eswc-conferences.org/call-for-papers-llms/)

Via

Access Paper or Ask Questions

Toward FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph

Apr 12, 2024

Raia Abu Ahmad, Jennifer D'Souza, Matthäus Zloch, Wolfgang Otto, Georg Rehm, Allard Oelen, Stefan Dietze, Sören Auer

Abstract:Search engines these days can serve datasets as search results. Datasets get picked up by search technologies based on structured descriptions on their official web pages, informed by metadata ontologies such as the Dataset content type of schema.org. Despite this promotion of the content type dataset as a first-class citizen of search results, a vast proportion of datasets, particularly research datasets, still need to be made discoverable and, therefore, largely remain unused. This is due to the sheer volume of datasets released every day and the inability of metadata to reflect a dataset's content and context accurately. This work seeks to improve this situation for a specific class of datasets, namely research datasets, which are the result of research endeavors and are accompanied by a scholarly publication. We propose the ORKG-Dataset content type, a specialized branch of the Open Research Knowledge Graoh (ORKG) platform, which provides descriptive information and a semantic model for research datasets, integrating them with their accompanying scholarly publications. This work aims to establish a standardized framework for recording and reporting research datasets within the ORKG-Dataset content type. This, in turn, increases research dataset transparency on the web for their improved discoverability and applied use. In this paper, we present a proposal -- the minimum FAIR, comparable, semantic description of research datasets in terms of salient properties of their supporting publication. We design a specific application of the ORKG-Dataset semantic model based on 40 diverse research datasets on scientific information extraction.

* In Joint Proceedings of the Onto4FAIR 2023 Workshops: Collocated with FOIS 2023 and SEMANTICS 2023. pp.23-31. https://hal.science/hal-04312604
* 8 pages, 1 figure, published in the Joint Proceedings of the Onto4FAIR 2023 Workshops

Via

Access Paper or Ask Questions

Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Jan 18, 2024

Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer

Figure 1 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Figure 2 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Figure 3 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Figure 4 for Large Language Models for Scientific Information Extraction: An Empirical Study for Virology

Abstract:In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions. These representations provide users with a concise overview, aiding scientists in navigating the dense academic landscape. Our novel automated approach leverages the robust text generation capabilities of LLMs to produce structured scholarly contribution summaries, offering both a practical solution and insights into LLMs' emergent abilities. For LLMs, the prime focus is on improving their general intelligence as conversational agents. We argue that these models can also be applied effectively in information extraction (IE), specifically in complex IE tasks within terse domains like Science. This paradigm shift replaces the traditional modular, pipelined machine learning approach with a simpler objective expressed through instructions. Our results show that finetuned FLAN-T5 with 1000x fewer parameters than the state-of-the-art GPT-davinci is competitive for the task.

* 8 pages, 6 figures, Accepted as Findings of the ACL: EACL 2024

Via

Access Paper or Ask Questions

LLMs4OL: Large Language Models for Ontology Learning

Aug 02, 2023

Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer

Figure 1 for LLMs4OL: Large Language Models for Ontology Learning

Figure 2 for LLMs4OL: Large Language Models for Ontology Learning

Figure 3 for LLMs4OL: Large Language Models for Ontology Learning

Figure 4 for LLMs4OL: Large Language Models for Ontology Learning

Abstract:We propose the LLMs4OL approach, which utilizes Large Language Models (LLMs) for Ontology Learning (OL). LLMs have shown significant advancements in natural language processing, demonstrating their ability to capture complex language patterns in different knowledge domains. Our LLMs4OL paradigm investigates the following hypothesis: \textit{Can LLMs effectively apply their language pattern capturing capability to OL, which involves automatically extracting and structuring knowledge from natural language text?} To test this hypothesis, we conduct a comprehensive evaluation using the zero-shot prompting method. We evaluate nine different LLM model families for three main OL tasks: term typing, taxonomy discovery, and extraction of non-taxonomic relations. Additionally, the evaluations encompass diverse genres of ontological knowledge, including lexicosemantic knowledge in WordNet, geographical knowledge in GeoNames, and medical knowledge in UMLS.

* 15 pages main content, 27 pages overall, 2 Figures, accepted for publication at ISWC 2023 research track

Via

Access Paper or Ask Questions

Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph

May 22, 2023

Jennifer D'Souza, Moussab Hrou, Sören Auer

Abstract:There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings. The prompt-based training approach has been found to be effective in generalizing pre-trained or fine-tuned models for transfer to resource-scarce settings. This work, for the first time, reports results on adopting prompt-based training of transformers for \textit{scholarly knowledge graph object prediction}. The work is unique in the following two main aspects. 1) It deviates from the other works proposing entity and relation extraction pipelines for predicting objects of a scholarly knowledge graph. 2) While other works have tested the method on text genera relatively close to the general knowledge domain, we test the method for a significantly different domain, i.e. scholarly knowledge, in turn testing the linguistic, probabilistic, and factual generalizability of these large-scale transformer models. We find that (i) per expectations, transformer models when tested out-of-the-box underperform on a new domain of data, (ii) prompt-based training of the models achieve performance boosts of up to 40\% in a relaxed evaluation setting, and (iii) testing the models on a starkly different domain even with a clever training objective in a low resource setting makes evident the domain knowledge capture gap offering an empirically-verified incentive for investing more attention and resources to the scholarly domain in the context of transformer models.

* 14 pages, 1 figure, accepted for publication as a short paper at DEXA 2023 (https://www.dexa.org/dexa2023)

Via

Access Paper or Ask Questions

ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph

May 10, 2023

Salomon Kabongo, Jennifer D'Souza, Sören Auer

Figure 1 for ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph

Figure 2 for ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph

Figure 3 for ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph

Figure 4 for ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph

Abstract:The purpose of this work is to describe the Orkg-Leaderboard software designed to extract leaderboards defined as Task-Dataset-Metric tuples automatically from large collections of empirical research papers in Artificial Intelligence (AI). The software can support both the main workflows of scholarly publishing, viz. as LaTeX files or as PDF files. Furthermore, the system is integrated with the Open Research Knowledge Graph (ORKG) platform, which fosters the machine-actionable publishing of scholarly findings. Thus the system output, when integrated within the ORKG's supported Semantic Web infrastructure of representing machine-actionable 'resources' on the Web, enables: 1) broadly, the integration of empirical results of researchers across the world, thus enabling transparency in empirical research with the potential to also being complete contingent on the underlying data source(s) of publications; and 2) specifically, enables researchers to track the progress in AI with an overview of the state-of-the-art (SOTA) across the most common AI tasks and their corresponding datasets via dynamic ORKG frontend views leveraging tables and visualization charts over the machine-actionable data. Our best model achieves performances above 90% F1 on the \textit{leaderboard} extraction task, thus proving Orkg-Leaderboards a practically viable tool for real-world usage. Going forward, in a sense, Orkg-Leaderboards transforms the leaderboard extraction task to an automated digitalization task, which has been, for a long time in the community, a crowdsourced endeavor.

* NA. arXiv admin note: text overlap with arXiv:2109.13089

Via

Access Paper or Ask Questions

Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library Collections

May 03, 2023

Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie

Abstract:The rapid growth of research publications has placed great demands on digital libraries (DL) for advanced information management technologies. To cater to these demands, techniques relying on knowledge-graph structures are being advocated. In such graph-based pipelines, inferring semantic relations between related scientific concepts is a crucial step. Recently, BERT-based pre-trained models have been popularly explored for automatic relation classification. Despite significant progress, most of them were evaluated in different scenarios, which limits their comparability. Furthermore, existing methods are primarily evaluated on clean texts, which ignores the digitization context of early scholarly publications in terms of machine scanning and optical character recognition (OCR). In such cases, the texts may contain OCR noise, in turn creating uncertainty about existing classifiers' performances. To address these limitations, we started by creating OCR-noisy texts based on three clean corpora. Given these parallel corpora, we conducted a thorough empirical evaluation of eight Bert-based classification models by focusing on three factors: (1) Bert variants; (2) classification strategies; and, (3) OCR noise impacts. Experiments on clean data show that the domain-specific pre-trained Bert is the best variant to identify scientific relations. The strategy of predicting a single relation each time outperforms the one simultaneously identifying multiple relations in general. The optimal classifier's performance can decline by around 10% to 20% in F-score on the noisy corpora. Insights discussed in this study can help DL stakeholders select techniques for building optimal knowledge-graph-based systems.

* International Journal on Digital Libraries (2022)

Via

Access Paper or Ask Questions

Zero-shot Entailment of Leaderboards for Empirical AI Research

Mar 29, 2023

Salomon Kabongo, Jennifer D'Souza, Sören Auer

Figure 1 for Zero-shot Entailment of Leaderboards for Empirical AI Research

Figure 2 for Zero-shot Entailment of Leaderboards for Empirical AI Research

Figure 3 for Zero-shot Entailment of Leaderboards for Empirical AI Research

Abstract:We present a large-scale empirical investigation of the zero-shot learning phenomena in a specific recognizing textual entailment (RTE) task category, i.e. the automated mining of leaderboards for Empirical AI Research. The prior reported state-of-the-art models for leaderboards extraction formulated as an RTE task, in a non-zero-shot setting, are promising with above 90% reported performances. However, a central research question remains unexamined: did the models actually learn entailment? Thus, for the experiments in this paper, two prior reported state-of-the-art models are tested out-of-the-box for their ability to generalize or their capacity for entailment, given leaderboard labels that were unseen during training. We hypothesize that if the models learned entailment, their zero-shot performances can be expected to be moderately high as well--perhaps, concretely, better than chance. As a result of this work, a zero-shot labeled dataset is created via distant labeling formulating the leaderboard extraction RTE task.

* 5 pages, 1 figure. Accepted for publication at JCDL 2023 - Late Breaking Results and Datasets track (https://2023.jcdl.org/calls/papers/#paper_types), official citation forthcoming

Via

Access Paper or Ask Questions

Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG

Mar 27, 2023

Fajar J. Ekaputra, Majlinda Llugiqi, Marta Sabou, Andreas Ekelhart, Heiko Paulheim, Anna Breit, Artem Revenko, Laura Waltersdorfer, Kheir Eddine Farfar, Sören Auer

Figure 1 for Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG

Figure 2 for Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG

Figure 3 for Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG

Figure 4 for Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG

Abstract:In line with the general trend in artificial intelligence research to create intelligent systems that combine learning and symbolic components, a new sub-area has emerged that focuses on combining machine learning (ML) components with techniques developed by the Semantic Web (SW) community - Semantic Web Machine Learning (SWeML for short). Due to its rapid growth and impact on several communities in the last two decades, there is a need to better understand the space of these SWeML Systems, their characteristics, and trends. Yet, surveys that adopt principled and unbiased approaches are missing. To fill this gap, we performed a systematic study and analyzed nearly 500 papers published in the last decade in this area, where we focused on evaluating architectural, and application-specific features. Our analysis identified a rapidly growing interest in SWeML Systems, with a high impact on several application domains and tasks. Catalysts for this rapid growth are the increased application of deep learning and knowledge graph technologies. By leveraging the in-depth understanding of this area acquired through this study, a further key contribution of this paper is a classification system for SWeML Systems which we publish as ontology.

* Preprint of a paper in the resource track of the 20th Extended Semantic Web Conference (ESWC'23)

Via

Access Paper or Ask Questions