Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Bonald

IP Paris

Tailoring Strictly Proper Scoring Rules for Downstream Tasks: An Application to Causal Inference

Jun 02, 2026

Roman Plaud, Alexandre Perez-Lebel, Antoine Saillenfest, Thomas Bonald, Marine Le Morvan, Gaël Varoquaux, Matthieu Labeau

Abstract:Probabilistic models are typically trained using task-agnostic objectives like log-loss, which can lead to significant errors in downstream estimation. This disconnect is especially critical in Inverse Probability Weighting (IPW) for causal inference, where propensity score errors near $0$ and $1$ often lead to high bias and variance. We propose a principled framework for deriving task-specific strictly proper scoring rules by matching the local curvature of the downstream error metric. We apply this to the Average Treatment Effect (ATE) estimation, deriving a closed-form loss and its corresponding canonical probability mapping that can be readily integrated with any model like a neural network or a gradient boosting algorithm. Extensive evaluations on causal inference benchmarks demonstrate that our tailored objective consistently outperforms standard likelihood-based and covariate-balancing approaches.

* Accepted to ICML 2026

Via

Access Paper or Ask Questions

Implicit Regularization of Mini-Batch Training in Graph Neural Networks

May 21, 2026

Clement Wang, Antoine Vialle, Robin Vaysse, Thomas Bonald

Abstract:Mini-batch training of Graph Neural Networks (GNNs) is fundamentally different from training on i.i.d. data: sampling a subgraph alters the topology and introduces boundary effects, leading prior work to develop structure-aware samplers that preserve local connectivity and reduce embedding variance. Surprisingly, we demonstrate that the simplest possible scheme, Random Node Sampling (RNS), training on the induced subgraph of uniformly sampled nodes, matches or outperforms full-graph training on 8 of 10 datasets at a fraction of the wall-clock time and memory. To explain this, we apply backward error analysis to graph mini-batch Stochastic Gradient Descent (SGD) and show that it implicitly minimizes the sampled loss plus a regularizer proportional to the mini-batch gradient variance, a quantity directly shaped by the sampler. Although RNS discards local structure, it produces mini-batches whose expected loss is closer to the full-graph loss, and whose per-batch gradients have lower variance, yielding a better implicit objective. Our analysis reframes the choice of graph sampler as a form of implicit regularization, and identifies RNS as a strong, theoretically grounded method for scalable GNN training.

Via

Access Paper or Ask Questions

FLORA: Unsupervised Knowledge Graph Alignment by Fuzzy Logic

Oct 23, 2025

Yiwen Peng, Thomas Bonald, Fabian M. Suchanek

Abstract:Knowledge graph alignment is the task of matching equivalent entities (that is, instances and classes) and relations across two knowledge graphs. Most existing methods focus on pure entity-level alignment, computing the similarity of entities in some embedding space. They lack interpretable reasoning and need training data to work. In this paper, we propose FLORA, a simple yet effective method that (1) is unsupervised, i.e., does not require training data, (2) provides a holistic alignment for entities and relations iteratively, (3) is based on fuzzy logic and thus delivers interpretable results, (4) provably converges, (5) allows dangling entities, i.e., entities without a counterpart in the other KG, and (6) achieves state-of-the-art results on major benchmarks.

* The 24th International Semantic Web Conference (ISWC), Nov 2025, Nara / Japan, Japan

Via

Access Paper or Ask Questions

Graph as a feature: improving node classification with non-neural graph-aware logistic regression

Nov 19, 2024

Simon Delarue, Thomas Bonald, Tiphaine Viard

Abstract:Graph Neural Networks (GNNs) and their message passing framework that leverages both structural and feature information, have become a standard method for solving graph-based machine learning problems. However, these approaches still struggle to generalise well beyond datasets that exhibit strong homophily, where nodes of the same class tend to connect. This limitation has led to the development of complex neural architectures that pose challenges in terms of efficiency and scalability. In response to these limitations, we focus on simpler and more scalable approaches and introduce Graph-aware Logistic Regression (GLR), a non-neural model designed for node classification tasks. Unlike traditional graph algorithms that use only a fraction of the information accessible to GNNs, our proposed model simultaneously leverages both node features and the relationships between entities. However instead of relying on message passing, our approach encodes each node's relationships as an additional feature vector, which is then combined with the node's self attributes. Extensive experimental results, conducted within a rigorous evaluation framework, show that our proposed GLR approach outperforms both foundational and sophisticated state-of-the-art GNN models in node classification tasks. Going beyond the traditional limited benchmarks, our experiments indicate that GLR increases generalisation ability while reaching performance gains in computation time up to two orders of magnitude compared to it best neural competitor.

Via

Access Paper or Ask Questions

Revisiting Hierarchical Text Classification: Inference and Metrics

Oct 02, 2024

Roman Plaud, Matthieu Labeau, Antoine Saillenfest, Thomas Bonald

Figure 1 for Revisiting Hierarchical Text Classification: Inference and Metrics

Figure 2 for Revisiting Hierarchical Text Classification: Inference and Metrics

Figure 3 for Revisiting Hierarchical Text Classification: Inference and Metrics

Figure 4 for Revisiting Hierarchical Text Classification: Inference and Metrics

Abstract:Hierarchical text classification (HTC) is the task of assigning labels to a text within a structured space organized as a hierarchy. Recent works treat HTC as a conventional multilabel classification problem, therefore evaluating it as such. We instead propose to evaluate models based on specifically designed hierarchical metrics and we demonstrate the intricacy of metric choice and prediction inference method. We introduce a new challenging dataset and we evaluate fairly, recent sophisticated models, comparing them with a range of simple but strong baselines, including a new theoretically motivated loss. Finally, we show that those baselines are very often competitive with the latest models. This highlights the importance of carefully considering the evaluation methodology when proposing new methods for HTC. Code implementation and dataset are available at \url{https://github.com/RomanPlaud/revisitingHTC}.

* Accepted at CoNLL 2024

Via

Access Paper or Ask Questions

The Factuality of Large Language Models in the Legal Domain

Sep 18, 2024

Rajaa El Hamdani, Thomas Bonald, Fragkiskos Malliaros, Nils Holzenberger, Fabian Suchanek

Figure 1 for The Factuality of Large Language Models in the Legal Domain

Figure 2 for The Factuality of Large Language Models in the Legal Domain

Figure 3 for The Factuality of Large Language Models in the Legal Domain

Figure 4 for The Factuality of Large Language Models in the Legal Domain

Abstract:This paper investigates the factuality of large language models (LLMs) as knowledge bases in the legal domain, in a realistic usage scenario: we allow for acceptable variations in the answer, and let the model abstain from answering when uncertain. First, we design a dataset of diverse factual questions about case law and legislation. We then use the dataset to evaluate several LLMs under different evaluation methods, including exact, alias, and fuzzy matching. Our results show that the performance improves significantly under the alias and fuzzy matching methods. Further, we explore the impact of abstaining and in-context examples, finding that both strategies enhance precision. Finally, we demonstrate that additional pre-training on legal documents, as seen with SaulLM, further improves factual precision from 63% to 81%.

* CIKM 2024, short paper

Via

Access Paper or Ask Questions

Refining Wikidata Taxonomy using Large Language Models

Sep 06, 2024

Yiwen Peng, Thomas Bonald, Mehwish Alam

Figure 1 for Refining Wikidata Taxonomy using Large Language Models

Figure 2 for Refining Wikidata Taxonomy using Large Language Models

Figure 3 for Refining Wikidata Taxonomy using Large Language Models

Abstract:Due to its collaborative nature, Wikidata is known to have a complex taxonomy, with recurrent issues like the ambiguity between instances and classes, the inaccuracy of some taxonomic paths, the presence of cycles, and the high level of redundancy across classes. Manual efforts to clean up this taxonomy are time-consuming and prone to errors or subjective decisions. We present WiKC, a new version of Wikidata taxonomy cleaned automatically using a combination of Large Language Models (LLMs) and graph mining techniques. Operations on the taxonomy, such as cutting links or merging classes, are performed with the help of zero-shot prompting on an open-source LLM. The quality of the refined taxonomy is evaluated from both intrinsic and extrinsic perspectives, on a task of entity typing for the latter, showing the practical interest of WiKC.

* ACM International Conference on Information and Knowledge Management, Oct 2024, Boise, Idaho, United States

Via

Access Paper or Ask Questions

A Consistent Diffusion-Based Algorithm for Semi-Supervised Graph Learning

Nov 13, 2023

Thomas Bonald, Nathan de Lara

Figure 1 for A Consistent Diffusion-Based Algorithm for Semi-Supervised Graph Learning

Figure 2 for A Consistent Diffusion-Based Algorithm for Semi-Supervised Graph Learning

Figure 3 for A Consistent Diffusion-Based Algorithm for Semi-Supervised Graph Learning

Figure 4 for A Consistent Diffusion-Based Algorithm for Semi-Supervised Graph Learning

Abstract:The task of semi-supervised classification aims at assigning labels to all nodes of a graph based on the labels known for a few nodes, called the seeds. One of the most popular algorithms relies on the principle of heat diffusion, where the labels of the seeds are spread by thermoconductance and the temperature of each node at equilibrium is used as a score function for each label. In this paper, we prove that this algorithm is not consistent unless the temperatures of the nodes at equilibrium are centered before scoring. This crucial step does not only make the algorithm provably consistent on a block model but brings significant performance gains on real graphs.

* Complex Networks, 2023, Menton, France
* arXiv admin note: substantial text overlap with arXiv:2008.11944

Via

Access Paper or Ask Questions

Integrating the Wikidata Taxonomy into YAGO

Aug 23, 2023

Fabian Suchanek, Mehwish Alam, Thomas Bonald, Pierre-Henri Paris, Jules Soria

Figure 1 for Integrating the Wikidata Taxonomy into YAGO

Figure 2 for Integrating the Wikidata Taxonomy into YAGO

Figure 3 for Integrating the Wikidata Taxonomy into YAGO

Abstract:Wikidata is one of the largest public general-purpose Knowledge Bases (KBs). Yet, due to its collaborative nature, its schema and taxonomy have become convoluted. For the YAGO 4 KB, we combined Wikidata with the ontology from Schema.org, which reduced and cleaned up the taxonomy and constraints and made it possible to run automated reasoners on the data. However, it also cut away large parts of the Wikidata taxonomy. In this paper, we present our effort to merge the entire Wikidata taxonomy into the YAGO KB as much as possible. We pay particular attention to logical constraints and a careful distinction of classes and instances. Our work creates YAGO 4.5, which adds a rich layer of informative classes to YAGO, while at the same time keeping the KB logically consistent.

Via

Access Paper or Ask Questions

A Self-Encoder for Learning Nearest Neighbors

Jun 25, 2023

Armand Boschin, Thomas Bonald, Marc Jeanmougin

Figure 1 for A Self-Encoder for Learning Nearest Neighbors

Figure 2 for A Self-Encoder for Learning Nearest Neighbors

Figure 3 for A Self-Encoder for Learning Nearest Neighbors

Figure 4 for A Self-Encoder for Learning Nearest Neighbors

Abstract:We present the self-encoder, a neural network trained to guess the identity of each data sample. Despite its simplicity, it learns a very useful representation of data, in a self-supervised way. Specifically, the self-encoder learns to distribute the data samples in the embedding space so that they are linearly separable from one another. This induces a geometry where two samples are close in the embedding space when they are not easy to differentiate. The self-encoder can then be combined with a nearest-neighbor classifier or regressor for any subsequent supervised task. Unlike regular nearest neighbors, the predictions resulting from this encoding of data are invariant to any scaling of features, making any preprocessing like min-max scaling not necessary. The experiments show the efficiency of the approach, especially on heterogeneous data mixing numerical features and categorical features.

Via

Access Paper or Ask Questions