Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Raphaël Troncy

Hallucination or Creativity: How to Evaluate AI-Generated Scientific Stories?

Feb 02, 2026

Alex Argese, Pasquale Lisena, Raphaël Troncy

Abstract:Generative AI can turn scientific articles into narratives for diverse audiences, but evaluating these stories remains challenging. Storytelling demands abstraction, simplification, and pedagogical creativity-qualities that are not often well-captured by standard summarization metrics. Meanwhile, factual hallucinations are critical in scientific contexts, yet, detectors often misclassify legitimate narrative reformulations or prove unstable when creativity is involved. In this work, we propose StoryScore, a composite metric for evaluating AI-generated scientific stories. StoryScore integrates semantic alignment, lexical grounding, narrative control, structural fidelity, redundancy avoidance, and entity-level hallucination detection into a unified framework. Our analysis also reveals why many hallucination detection methods fail to distinguish pedagogical creativity from factual errors, highlighting a key limitation: while automatic metrics can effectively assess semantic similarity with original content, they struggle to evaluate how it is narrated and controlled.

Via

Access Paper or Ask Questions

Multimodal Metadata Assignment for Cultural Heritage Artifacts

Jun 01, 2024

Luis Rei, Dunja Mladenić, Mareike Dorozynski, Franz Rottensteiner, Thomas Schleider, Raphaël Troncy, Jorge Sebastián Lozano, Mar Gaitán Salvatella

Abstract:We develop a multimodal classifier for the cultural heritage domain using a late fusion approach and introduce a novel dataset. The three modalities are Image, Text, and Tabular data. We based the image classifier on a ResNet convolutional neural network architecture and the text classifier on a multilingual transformer architecture (XML-Roberta). Both are trained as multitask classifiers and use the focal loss to handle class imbalance. Tabular data and late fusion are handled by Gradient Tree Boosting. We also show how we leveraged specific data models and taxonomy in a Knowledge Graph to create the dataset and to store classification results. All individual classifiers accurately predict missing properties in the digitized silk artifacts, with the multimodal approach providing the best results.

* Multimedia Systems 29 (2023) 847-869

Via

Access Paper or Ask Questions

Improving (Re-)Usability of Musical Datasets: An Overview of the DOREMUS Project

May 06, 2024

Pasquale Lisena, Manel Achichi, Pierre Choffé, Cécile Cecconi, Konstantin Todorov, Bernard Jacquemin, Raphaël Troncy

Figure 1 for Improving (Re-)Usability of Musical Datasets: An Overview of the DOREMUS Project

Figure 2 for Improving (Re-)Usability of Musical Datasets: An Overview of the DOREMUS Project

Figure 3 for Improving (Re-)Usability of Musical Datasets: An Overview of the DOREMUS Project

Figure 4 for Improving (Re-)Usability of Musical Datasets: An Overview of the DOREMUS Project

Abstract:DOREMUS works on a better description of music by building new tools to link and explore the data of three French institutions. This paper gives an overview of the data model based on FRBRoo, explains the conversion and linking processes using linked data technologies and presents the prototypes created to consume the data according to the web users' needs.

* Bibliothek Forschung und Praxis, 2018, 42 (2), pp.194-205.

Via

Access Paper or Ask Questions

Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

Apr 11, 2019

Charlotte Rudnik, Thibault Ehrhart, Olivier Ferret, Denis Teyssou, Raphaël Troncy, Xavier Tannier

Figure 1 for Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

Figure 2 for Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

Figure 3 for Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

Figure 4 for Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

Abstract:News agencies produce thousands of multimedia stories describing events happening in the world that are either scheduled such as sports competitions, political summits and elections, or breaking events such as military conflicts, terrorist attacks, natural disasters, etc. When writing up those stories, journalists refer to contextual background and to compare with past similar events. However, searching for precise facts described in stories is hard. In this paper, we propose a general method that leverages the Wikidata knowledge base to produce semantic annotations of news articles. Next, we describe a semantic search engine that supports both keyword based search in news articles and structured data search providing filters for properties belonging to specific event schemas that are automatically inferred.

* WikiWorkshop at The Web Conference 2019

Via

Access Paper or Ask Questions

Analysis of Named Entity Recognition and Linking for Tweets

Oct 27, 2014

Leon Derczynski, Diana Maynard, Giuseppe Rizzo, Marieke van Erp, Genevieve Gorrell, Raphaël Troncy, Johann Petrak, Kalina Bontcheva

Figure 1 for Analysis of Named Entity Recognition and Linking for Tweets

Figure 2 for Analysis of Named Entity Recognition and Linking for Tweets

Figure 3 for Analysis of Named Entity Recognition and Linking for Tweets

Figure 4 for Analysis of Named Entity Recognition and Linking for Tweets

Abstract:Applying natural language processing for mining and intelligent information access to tweets (a form of microblog) is a challenging, emerging research area. Unlike carefully authored news text and other longer content, tweets pose a number of new challenges, due to their short, noisy, context-dependent, and dynamic nature. Information extraction from tweets is typically performed in a pipeline, comprising consecutive stages of language identification, tokenisation, part-of-speech tagging, named entity recognition and entity disambiguation (e.g. with respect to DBpedia). In this work, we describe a new Twitter entity disambiguation dataset, and conduct an empirical analysis of named entity recognition and disambiguation, investigating how robust a number of state-of-the-art systems are on such noisy texts, what the main sources of error are, and which problems should be further investigated to improve the state of the art.

* Information Processing & Management 51 (2), 32-49, 2014
* 35 pages, accepted to journal Information Processing and Management

Via

Access Paper or Ask Questions