Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sławomir T. Wierzchoń

Rough Sets for Explainability of Spectral Graph Clustering

Dec 13, 2025

Bartłomiej Starosta, Sławomir T. Wierzchoń, Piotr Borkowski, Dariusz Czerski, Marcin Sydow, Eryk Laskowski, Mieczysław A. Kłopotek

Abstract:Graph Spectral Clustering methods (GSC) allow representing clusters of diverse shapes, densities, etc. However, the results of such algorithms, when applied e.g. to text documents, are hard to explain to the user, especially due to embedding in the spectral space which has no obvious relation to document contents. Furthermore, the presence of documents without clear content meaning and the stochastic nature of the clustering algorithms deteriorate explainability. This paper proposes an enhancement to the explanation methodology, proposed in an earlier research of our team. It allows us to overcome the latter problems by taking inspiration from rough set theory.

* 24 figures, 21tables

Via

Access Paper or Ask Questions

A Method for Handling Negative Similarities in Explainable Graph Spectral Clustering of Text Documents -- Extended Version

Apr 16, 2025

Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, Bartłomiej Starosta, Dariusz Czerski, Piotr Borkowski

Abstract:This paper investigates the problem of Graph Spectral Clustering with negative similarities, resulting from document embeddings different from the traditional Term Vector Space (like doc2vec, GloVe, etc.). Solutions for combinatorial Laplacians and normalized Laplacians are discussed. An experimental investigation shows the advantages and disadvantages of 6 different solutions proposed in the literature and in this research. The research demonstrates that GloVe embeddings frequently cause failures of normalized Laplacian based GSC due to negative similarities. Furthermore, application of methods curing similarity negativity leads to accuracy improvement for both combinatorial and normalized Laplacian based GSC. It also leads to applicability for GloVe embeddings of explanation methods developed originally bythe authors for Term Vector Space embeddings.

* 1 figure, 17 pages, this is an extended version of a paper accepted for the 25th International Conference on Computational Science (ICCS), 7-9 July 2025

Via

Access Paper or Ask Questions

Eigenvalue-based Incremental Spectral Clustering

Aug 18, 2023

Mieczysław A. Kłopotek, Bartłmiej Starosta, Sławomir T. Wierzchoń

Abstract:Our previous experiments demonstrated that subsets collections of (short) documents (with several hundred entries) share a common normalized in some way eigenvalue spectrum of combinatorial Laplacian. Based on this insight, we propose a method of incremental spectral clustering. The method consists of the following steps: (1) split the data into manageable subsets, (2) cluster each of the subsets, (3) merge clusters from different subsets based on the eigenvalue spectrum similarity to form clusters of the entire set. This method can be especially useful for clustering methods of complexity strongly increasing with the size of the data sample,like in case of typical spectral clustering. Experiments were performed showing that in fact the clustering and merging the subsets yields clusters close to clustering the entire dataset.

* 14 tables, 6 figures

Via

Access Paper or Ask Questions

Explainable Graph Spectral Clustering of Text Documents

Aug 01, 2023

Bartłomiej Starosta, Mieczysław A. Kłopotek, Sławomir T. Wierzchoń

Figure 1 for Explainable Graph Spectral Clustering of Text Documents

Figure 2 for Explainable Graph Spectral Clustering of Text Documents

Figure 3 for Explainable Graph Spectral Clustering of Text Documents

Figure 4 for Explainable Graph Spectral Clustering of Text Documents

Abstract:Spectral clustering methods are known for their ability to represent clusters of diverse shapes, densities etc. However, results of such algorithms, when applied e.g. to text documents, are hard to explain to the user, especially due to embedding in the spectral space which has no obvious relation to document contents. Therefore there is an urgent need to elaborate methods for explaining the outcome of the clustering. This paper presents a contribution towards this goal. We present a proposal of explanation of results of combinatorial Laplacian based graph spectral clustering. It is based on showing (approximate) equivalence of combinatorial Laplacian embedding, $K$-embedding (proposed in this paper) and term vector space embedding. Hence a bridge is constructed between the textual contents and the clustering results. We provide theoretical background for this approach. We performed experimental study showing that $K$-embedding approximates well Laplacian embedding under favourable block matrix conditions and show that approximation is good enough under other conditions.

* 4 figures, 15 tables

Via

Access Paper or Ask Questions

Query Optimization Properties of Modified VBS

Sep 26, 2019

Mieczysław A. Kłopotek, Sławomir T. Wierzchoń

Figure 1 for Query Optimization Properties of Modified VBS

Figure 2 for Query Optimization Properties of Modified VBS

Abstract:Valuation-Based~System can represent knowledge in different domains including probability theory, Dempster-Shafer theory and possibility theory. More recent studies show that the framework of VBS is also appropriate for representing and solving Bayesian decision problems and optimization problems. In this paper after introducing the valuation based system (VBS) framework, we present Markov-like properties of VBS and a method for resolving queries to VBS.

* 7 pages, 2 figures; published as: M.A. K{\l}opotek, S.T. Wierzcho\'n: Query optimization properties of modified Valuation-Based Systems. [in:] R. Trappl Ed.: Cybernetics and Systems . Proc. 13th European Meeting on Cybernetics and System Research, Vienna, 9-12 April 1996, Vol. I. Austrian Society for Cybernetic Studies, 1996, pp. 335-340

Via

Access Paper or Ask Questions

On Marginally Correct Approximations of Dempster-Shafer Belief Functions from Data

Dec 07, 2018

Mieczysław A. Kłopotek, Sławomir T. Wierzchoń

Abstract:Mathematical Theory of Evidence (MTE), a foundation for reasoning under partial ignorance, is blamed to leave frequencies outside (or aside of) its framework. The seriousness of this accusation is obvious: no experiment may be run to compare the performance of MTE-based models of real world processes against real world data. In this paper we consider this problem from the point of view of conditioning in the MTE. We describe the class of belief functions for which marginal consistency with observed frequencies may be achieved and conditional belief functions are proper belief functions,%\ and deal with implications for (marginal) approximation of general belief functions by this class of belief functions and for inference models in MTE.

* M.A. K{\l}opotek, S.T. Wierzcho\'n: On Marginally Correct Approximations of Dempster-Shafer Belief Functions from Data. Proc. IPMU'96 (Information Processing and Management of Uncertainty), Grenada (Spain), Publisher: Universitaed de Granada, 1-5 July 1996, Vol II, pp. 769-774

Via

Access Paper or Ask Questions

Basic Formal Properties of A Relational Model of The Mathematical Theory of Evidence

Apr 08, 2017

Mieczysław A. Kłopotek, Sławomir T. Wierzchoń

Figure 1 for Basic Formal Properties of A Relational Model of The Mathematical Theory of Evidence

Figure 2 for Basic Formal Properties of A Relational Model of The Mathematical Theory of Evidence

Figure 3 for Basic Formal Properties of A Relational Model of The Mathematical Theory of Evidence

Figure 4 for Basic Formal Properties of A Relational Model of The Mathematical Theory of Evidence

Abstract:The paper presents a novel view of the Dempster-Shafer belief function as a measure of diversity in relational data bases. It is demonstrated that under the interpretation The Dempster rule of evidence combination corresponds to the join operator of the relational database theory. This rough-set based interpretation is qualitative in nature and can represent a number of belief function operators. The interpretation has the property that Given a definition of the belief measure of objects in the interpretation domain we can perform operations in this domain and the measure of the resulting object is derivable from measures of component objects via belief operator. We demonstrated this property for Dempster rule of combination, marginalization, Shafer's conditioning, independent variables, Shenoy's notion of conditional independence of variables. The interpretation is based on rough sets (in connection with decision tables), but differs from previous interpretations of this type in that it counts the diversity rather than frequencies in a decision table.

* This is the preliminary version of the paper published in Demonstratio Mathematica. Vol XXXI No 3,1998, pp. 669-688
* 23 pages

Via

Access Paper or Ask Questions