Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuichi Ike

LIGM

Persistence-based topological optimization: a survey

Mar 24, 2026

Mathieu Carriere, Yuichi Ike, Théo Lacombe, Naoki Nishikawa

Abstract:Computational topology provides a tool, persistent homology, to extract quantitative descriptors from structured objects (images, graphs, point clouds, etc). These descriptors can then be involved in optimization problems, typically as a way to incorporate topological priors or to regularize machine learning models. This is usually achieved by minimizing adequate, topologically-informed losses based on these descriptors, which, in turn, naturally raises theoretical and practical questions about the possibility of optimizing such loss functions using gradient-based algorithms. This has been an active research field in the topological data analysis community over the last decade, and various techniques have been developed to enable optimization of persistence-based loss functions with gradient descent schemes. This survey presents the current state of this field, covering its theoretical foundations, the algorithmic aspects, and showcasing practical uses in several applications. It includes a detailed introduction to persistence theory and, as such, aims at being accessible to mathematicians and data scientists newcomers to the field. It is accompanied by an open-source library which implements the different approaches covered in this survey, providing a convenient playground for researchers to get familiar with the field.

Via

Access Paper or Ask Questions

Learning Tangent Bundles and Characteristic Classes with Autoencoder Atlases

Feb 26, 2026

Eduardo Paluzo-Hidalgo, Yuichi Ike

Abstract:We introduce a theoretical framework that connects multi-chart autoencoders in manifold learning with the classical theory of vector bundles and characteristic classes. Rather than viewing autoencoders as producing a single global Euclidean embedding, we treat a collection of locally trained encoder-decoder pairs as a learned atlas on a manifold. We show that any reconstruction-consistent autoencoder atlas canonically defines transition maps satisfying the cocycle condition, and that linearising these transition maps yields a vector bundle coinciding with the tangent bundle when the latent dimension matches the intrinsic dimension of the manifold. This construction provides direct access to differential-topological invariants of the data. In particular, we show that the first Stiefel-Whitney class can be computed from the signs of the Jacobians of learned transition maps, yielding an algorithmic criterion for detecting orientability. We also show that non-trivial characteristic classes provide obstructions to single-chart representations, and that the minimum number of autoencoder charts is determined by the good cover structure of the manifold. Finally, we apply our methodology to low-dimensional orientable and non-orientable manifolds, as well as to a non-orientable high-dimensional image dataset.

Via

Access Paper or Ask Questions

Learning Decision Trees and Forests with Algorithmic Recourse

Jun 03, 2024

Kentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike

Figure 1 for Learning Decision Trees and Forests with Algorithmic Recourse

Figure 2 for Learning Decision Trees and Forests with Algorithmic Recourse

Figure 3 for Learning Decision Trees and Forests with Algorithmic Recourse

Figure 4 for Learning Decision Trees and Forests with Algorithmic Recourse

Abstract:This paper proposes a new algorithm for learning accurate tree-based models while ensuring the existence of recourse actions. Algorithmic Recourse (AR) aims to provide a recourse action for altering the undesired prediction result given by a model. Typical AR methods provide a reasonable action by solving an optimization task of minimizing the required effort among executable actions. In practice, however, such actions do not always exist for models optimized only for predictive performance. To alleviate this issue, we formulate the task of learning an accurate classification tree under the constraint of ensuring the existence of reasonable actions for as many instances as possible. Then, we propose an efficient top-down greedy algorithm by leveraging the adversarial training techniques. We also show that our proposed algorithm can be applied to the random forest, which is known as a popular framework for learning tree ensembles. Experimental results demonstrated that our method successfully provided reasonable actions to more instances than the baselines without significantly degrading accuracy and computational efficiency.

* 27 pages, 10 figures, to appear in the 41st International Conference on Machine Learning (ICML 2024)

Via

Access Paper or Ask Questions

Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds

Jul 18, 2023

Naoki Nishikawa, Yuichi Ike, Kenji Yamanishi

Figure 1 for Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds

Figure 2 for Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds

Figure 3 for Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds

Figure 4 for Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds

Abstract:Machine learning for point clouds has been attracting much attention, with many applications in various fields, such as shape recognition and material science. To enhance the accuracy of such machine learning methods, it is known to be effective to incorporate global topological features, which are typically extracted by persistent homology. In the calculation of persistent homology for a point cloud, we need to choose a filtration for the point clouds, an increasing sequence of spaces. Because the performance of machine learning methods combined with persistent homology is highly affected by the choice of a filtration, we need to tune it depending on data and tasks. In this paper, we propose a framework that learns a filtration adaptively with the use of neural networks. In order to make the resulting persistent homology isometry-invariant, we develop a neural network architecture with such invariance. Additionally, we theoretically show a finite-dimensional approximation result that justifies our architecture. Experimental results demonstrated the efficacy of our framework in several classification tasks.

* 17 pages with 4 figures

Via

Access Paper or Ask Questions

MAGDiff: Covariate Data Set Shift Detection via Activation Graphs of Deep Neural Networks

May 22, 2023

Felix Hensel, Charles Arnal, Mathieu Carrière, Théo Lacombe, Hiroaki Kurihara, Yuichi Ike, Frédéric Chazal

Abstract:Despite their successful application to a variety of tasks, neural networks remain limited, like other machine learning methods, by their sensitivity to shifts in the data: their performance can be severely impacted by differences in distribution between the data on which they were trained and that on which they are deployed. In this article, we propose a new family of representations, called MAGDiff, that we extract from any given neural network classifier and that allows for efficient covariate data shift detection without the need to train a new model dedicated to this task. These representations are computed by comparing the activation graphs of the neural network for samples belonging to the training distribution and to the target distribution, and yield powerful data- and task-adapted statistics for the two-sample tests commonly used for data set shift detection. We demonstrate this empirically by measuring the statistical powers of two-sample Kolmogorov-Smirnov (KS) tests on several different data sets and shift types, and showing that our novel representations induce significant improvements over a state-of-the-art baseline relying on the network output.

Via

Access Paper or Ask Questions

Counterfactual Explanation with Missing Values

Apr 28, 2023

Kentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike

Figure 1 for Counterfactual Explanation with Missing Values

Figure 2 for Counterfactual Explanation with Missing Values

Figure 3 for Counterfactual Explanation with Missing Values

Figure 4 for Counterfactual Explanation with Missing Values

Abstract:Counterfactual Explanation (CE) is a post-hoc explanation method that provides a perturbation for altering the prediction result of a classifier. Users can interpret the perturbation as an "action" to obtain their desired decision results. Existing CE methods require complete information on the features of an input instance. However, we often encounter missing values in a given instance, and the previous methods do not work in such a practical situation. In this paper, we first empirically and theoretically show the risk that missing value imputation methods affect the validity of an action, as well as the features that the action suggests changing. Then, we propose a new framework of CE, named Counterfactual Explanation by Pairs of Imputation and Action (CEPIA), that enables users to obtain valid actions even with missing values and clarifies how actions are affected by imputation of the missing values. Specifically, our CEPIA provides a representative set of pairs of an imputation candidate for a given incomplete instance and its optimal action. We formulate the problem of finding such a set as a submodular maximization problem, which can be solved by a simple greedy algorithm with an approximation guarantee. Experimental results demonstrated the efficacy of our CEPIA in comparison with the baselines in the presence of missing values.

* 31 pages, 12 figures

Via

Access Paper or Ask Questions

Vanishing Component Analysis with Contrastive Normalization

Oct 27, 2022

Ryosuke Masuya, Yuichi Ike, Hiroshi Kera

Abstract:Vanishing component analysis (VCA) computes approximate generators of vanishing ideals of samples, which are further used for extracting nonlinear features of the samples. Recent studies have shown that normalization of approximate generators plays an important role and different normalization leads to generators of different properties. In this paper, inspired by recent self-supervised frameworks, we propose a contrastive normalization method for VCA, where we impose the generators to vanish on the target samples and to be normalized on the transformed samples. We theoretically show that a contrastive normalization enhances the discriminative power of VCA, and provide the algebraic interpretation of VCA under our normalization. Numerical experiments demonstrate the effectiveness of our method. This is the first study to tailor the normalization of approximate generators of vanishing ideals to obtain discriminative features.

* 22pages, 1 figure

Via

Access Paper or Ask Questions

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Feb 04, 2022

Thibault de Surrel, Felix Hensel, Mathieu Carrière, Théo Lacombe, Yuichi Ike, Hiroaki Kurihara, Marc Glisse, Frédéric Chazal

Figure 1 for RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Figure 2 for RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Figure 3 for RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Figure 4 for RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Abstract:The use of topological descriptors in modern machine learning applications, such as Persistence Diagrams (PDs) arising from Topological Data Analysis (TDA), has shown great potential in various domains. However, their practical use in applications is often hindered by two major limitations: the computational complexity required to compute such descriptors exactly, and their sensitivity to even low-level proportions of outliers. In this work, we propose to bypass these two burdens in a data-driven setting by entrusting the estimation of (vectorization of) PDs built on top of point clouds to a neural network architecture that we call RipsNet. Once trained on a given data set, RipsNet can estimate topological descriptors on test data very efficiently with generalization capacity. Furthermore, we prove that RipsNet is robust to input perturbations in terms of the 1-Wasserstein distance, a major improvement over the standard computation of PDs that only enjoys Hausdorff stability, yielding RipsNet to substantially outperform exactly-computed PDs in noisy settings. We showcase the use of RipsNet on both synthetic and real-world data. Our open-source implementation is publicly available at https://github.com/hensel-f/ripsnet and will be included in the Gudhi library.

* 23 pages, 4 figures

Via

Access Paper or Ask Questions

Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

May 07, 2021

Théo Lacombe, Yuichi Ike, Mathieu Carriere, Frédéric Chazal, Marc Glisse, Yuhei Umeda

Figure 1 for Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

Figure 2 for Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

Figure 3 for Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

Figure 4 for Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

Abstract:Although neural networks are capable of reaching astonishing performances on a wide variety of contexts, properly training networks on complicated tasks requires expertise and can be expensive from a computational perspective. In industrial applications, data coming from an open-world setting might widely differ from the benchmark datasets on which a network was trained. Being able to monitor the presence of such variations without retraining the network is of crucial importance. In this article, we develop a method to monitor trained neural networks based on the topological properties of their activation graphs. To each new observation, we assign a Topological Uncertainty, a score that aims to assess the reliability of the predictions by investigating the whole network instead of its final layer only, as typically done by practitioners. Our approach entirely works at a post-training level and does not require any assumption on the network architecture, optimization scheme, nor the use of data augmentation or auxiliary datasets; and can be faithfully applied on a large range of network architectures and data types. We showcase experimentally the potential of Topological Uncertainty in the context of trained network selection, Out-Of-Distribution detection, and shift-detection, both on synthetic and real datasets of images and graphs.

* 2021 International Joint Conference on Artificial Intelligence, Aug 2021, Montr{\'e}al, Canada

Via

Access Paper or Ask Questions

Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

Dec 22, 2020

Kentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike, Kento Uemura, Hiroki Arimura

Figure 1 for Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

Figure 2 for Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

Figure 3 for Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

Figure 4 for Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

Abstract:Post-hoc explanation methods for machine learning models have been widely used to support decision-making. One of the popular methods is Counterfactual Explanation (CE), which provides a user with a perturbation vector of features that alters the prediction result. Given a perturbation vector, a user can interpret it as an "action" for obtaining one's desired decision result. In practice, however, showing only a perturbation vector is often insufficient for users to execute the action. The reason is that if there is an asymmetric interaction among features, such as causality, the total cost of the action is expected to depend on the order of changing features. Therefore, practical CE methods are required to provide an appropriate order of changing features in addition to a perturbation vector. For this purpose, we propose a new framework called Ordered Counterfactual Explanation (OrdCE). We introduce a new objective function that evaluates a pair of an action and an order based on feature interaction. To extract an optimal pair, we propose a mixed-integer linear optimization approach with our objective function. Numerical experiments on real datasets demonstrated the effectiveness of our OrdCE in comparison with unordered CE methods.

* 20 pages, 5 figures, to appear in the 35th AAAI Conference on Artificial Intelligence (#AAAI2021)

Via

Access Paper or Ask Questions