Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lazhar Labiod

More Discriminative Sentence Embeddings via Semantic Graph Smoothing

Feb 20, 2024

Chakib Fettal, Lazhar Labiod, Mohamed Nadif

Abstract:This paper explores an empirical approach to learn more discriminantive sentence representations in an unsupervised fashion. Leveraging semantic graph smoothing, we enhance sentence embeddings obtained from pretrained models to improve results for the text clustering and classification tasks. Our method, validated on eight benchmarks, demonstrates consistent improvements, showcasing the potential of semantic graph smoothing in improving sentence embeddings for the supervised and unsupervised document categorization tasks.

* Accepted in EACL 2024

Via

Access Paper or Ask Questions

Scalable Multi-view Clustering via Explicit Kernel Features Maps

Feb 07, 2024

Chakib Fettal, Lazhar Labiod, Mohamed Nadif

Abstract:A growing awareness of multi-view learning as an important component in data science and machine learning is a consequence of the increasing prevalence of multiple views in real-world applications, especially in the context of networks. In this paper we introduce a new scalability framework for multi-view subspace clustering. An efficient optimization strategy is proposed, leveraging kernel feature maps to reduce the computational burden while maintaining good clustering performance. The scalability of the algorithm means that it can be applied to large-scale datasets, including those with millions of data points, using a standard machine, in a few minutes. We conduct extensive experiments on real-world benchmark networks of various sizes in order to evaluate the performance of our algorithm against state-of-the-art multi-view subspace clustering methods and attributed-network multi-view approaches.

Via

Access Paper or Ask Questions

Graph Cuts with Arbitrary Size Constraints Through Optimal Transport

Feb 07, 2024

Chakib Fettal, Lazhar Labiod, Mohamed Nadif

Abstract:A common way of partitioning graphs is through minimum cuts. One drawback of classical minimum cut methods is that they tend to produce small groups, which is why more balanced variants such as normalized and ratio cuts have seen more success. However, we believe that with these variants, the balance constraints can be too restrictive for some applications like for clustering of imbalanced datasets, while not being restrictive enough for when searching for perfectly balanced partitions. Here, we propose a new graph cut algorithm for partitioning graphs under arbitrary size constraints. We formulate the graph cut problem as a regularized Gromov-Wasserstein problem. We then propose to solve it using accelerated proximal GD algorithm which has global convergence guarantees, results in sparse solutions and only incurs an additional ratio of $\mathcal{O}(\log(n))$ compared to the classical spectral clustering algorithm but was seen to be more efficient.

Via

Access Paper or Ask Questions

Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

Jan 08, 2019

Severine Affeldt, Lazhar Labiod, Mohamed Nadif

Figure 1 for Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

Figure 2 for Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

Figure 3 for Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

Figure 4 for Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

Abstract:Recently, a number of works have studied clustering strategies that combine classical clustering algorithms and deep learning methods. These approaches follow either a sequential way, where a deep representation is learned using a deep autoencoder before obtaining clusters with k-means, or a simultaneous way, where deep representation and clusters are learned jointly by optimizing a single objective function. Both strategies improve clustering performance, however the robustness of these approaches is impeded by several deep autoencoder setting issues, among which the weights initialization, the width and number of layers or the number of epochs. To alleviate the impact of such hyperparameters setting on the clustering performance, we propose a new model which combines the spectral clustering and deep autoencoder strengths in an ensemble learning framework. Extensive experiments on various benchmark datasets demonstrate the potential and robustness of our approach compared to state-of-the art deep clustering methods.

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions