Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Timo Pierre Schrader

QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios

Oct 14, 2024

Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich

Figure 1 for QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios

Figure 2 for QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios

Figure 3 for QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios

Figure 4 for QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios

Abstract:Reasoning is key to many decision making processes. It requires consolidating a set of rule-like premises that are often associated with degrees of uncertainty and observations to draw conclusions. In this work, we address both the case where premises are specified as numeric probabilistic rules and situations in which humans state their estimates using words expressing degrees of certainty. Existing probabilistic reasoning datasets simplify the task, e.g., by requiring the model to only rank textual alternatives, by including only binary random variables, or by making use of a limited set of templates that result in less varied text. In this work, we present QUITE, a question answering dataset of real-world Bayesian reasoning scenarios with categorical random variables and complex relationships. QUITE provides high-quality natural language verbalizations of premises together with evidence statements and expects the answer to a question in the form of an estimated probability. We conduct an extensive set of experiments, finding that logic-based models outperform out-of-the-box large language models on all reasoning types (causal, evidential, and explaining-away). Our results provide evidence that neuro-symbolic models are a promising direction for improving complex reasoning. We release QUITE and code for training and experiments on Github.

* accepted at EMNLP 2024 (main)

Via

Access Paper or Ask Questions

BoschAI @ Causal News Corpus 2023: Robust Cause-Effect Span Extraction using Multi-Layer Sequence Tagging and Data Augmentation

Dec 11, 2023

Timo Pierre Schrader, Simon Razniewski, Lukas Lange, Annemarie Friedrich

Abstract:Understanding causality is a core aspect of intelligence. The Event Causality Identification with Causal News Corpus Shared Task addresses two aspects of this challenge: Subtask 1 aims at detecting causal relationships in texts, and Subtask 2 requires identifying signal words and the spans that refer to the cause or effect, respectively. Our system, which is based on pre-trained transformers, stacked sequence tagging, and synthetic data augmentation, ranks third in Subtask 1 and wins Subtask 2 with an F1 score of 72.8, corresponding to a margin of 13 pp. to the second-best system.

* 6 pages, 6 tables, 1 figure, published in "Proceedings of the 6th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text"

Via

Access Paper or Ask Questions

MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

Oct 24, 2023

Timo Pierre Schrader, Matteo Finco, Stefan Grünewald, Felix Hildebrand, Annemarie Friedrich

Figure 1 for MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

Figure 2 for MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

Figure 3 for MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

Figure 4 for MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

Abstract:Keeping track of all relevant recent publications and experimental results for a research area is a challenging task. Prior work has demonstrated the efficacy of information extraction models in various scientific areas. Recently, several datasets have been released for the yet understudied materials science domain. However, these datasets focus on sub-problems such as parsing synthesis procedures or on sub-domains, e.g., solid oxide fuel cells. In this resource paper, we present MuLMS, a new dataset of 50 open-access articles, spanning seven sub-domains of materials science. The corpus has been annotated by domain experts with several layers ranging from named entities over relations to frame structures. We present competitive neural models for all tasks and demonstrate that multi-task training with existing related resources leads to benefits.

* 17 pages, 2 figures, 28 tables, to be published in "Proceedings of the second Workshop on Information Extraction from Scientific Publications"

Via

Access Paper or Ask Questions

MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

Jul 05, 2023

Timo Pierre Schrader, Teresa Bürkle, Sophie Henning, Sherry Tan, Matteo Finco, Stefan Grünewald, Maira Indrikova, Felix Hildebrand, Annemarie Friedrich

Figure 1 for MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

Figure 2 for MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

Figure 3 for MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

Figure 4 for MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

Abstract:Scientific publications follow conventionalized rhetorical structures. Classifying the Argumentative Zone (AZ), e.g., identifying whether a sentence states a Motivation, a Result or Background information, has been proposed to improve processing of scholarly documents. In this work, we adapt and extend this idea to the domain of materials science research. We present and release a new dataset of 50 manually annotated research articles. The dataset spans seven sub-topics and is annotated with a materials-science focused multi-label annotation scheme for AZ. We detail corpus statistics and demonstrate high inter-annotator agreement. Our computational experiments show that using domain-specific pre-trained transformer-based text encoders is key to high classification performance. We also find that AZ categories from existing datasets in other domains are transferable to varying degrees.

* 15 pages, 2 figures, 14 tables, to be published in "Proceedings of the 4th Workshop on Computational Approaches to Discourse"

Via

Access Paper or Ask Questions