Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michalis Vazirgiannis

Ecole Polytechnique, AUEB

New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

Nov 16, 2022

Guillaume Salha-Galvan, Johannes F. Lutzeyer, George Dasoulas, Romain Hennequin, Michalis Vazirgiannis

Figure 1 for New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

Figure 2 for New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

Figure 3 for New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

Figure 4 for New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

Abstract:Graph autoencoders (GAE) and variational graph autoencoders (VGAE) emerged as powerful methods for link prediction (LP). Their performances are less impressive on community detection (CD), where they are often outperformed by simpler alternatives such as the Louvain method. It is still unclear to what extent one can improve CD with GAE and VGAE, especially in the absence of node features. It is moreover uncertain whether one could do so while simultaneously preserving good performances on LP in a multi-task setting. In this workshop paper, summarizing results from our journal publication (Salha-Galvan et al. 2022), we show that jointly addressing these two tasks with high accuracy is possible. For this purpose, we introduce a community-preserving message passing scheme, doping our GAE and VGAE encoders by considering both the initial graph and Louvain-based prior communities when computing embedding spaces. Inspired by modularity-based clustering, we further propose novel training and optimization strategies specifically designed for joint LP and CD. We demonstrate the empirical effectiveness of our approach, referred to as Modularity-Aware GAE and VGAE, on various real-world graphs.

* This NeurIPS 2022 GLFrontiers workshop paper summarizes results from the following journal article: arXiv:2202.00961. arXiv admin note: text overlap with arXiv:2205.14651

Via

Access Paper or Ask Questions

Improving Graph Neural Networks at Scale: Combining Approximate PageRank and CoreRank

Nov 08, 2022

Ariel R. Ramos Vela, Johannes F. Lutzeyer, Anastasios Giovanidis, Michalis Vazirgiannis

Figure 1 for Improving Graph Neural Networks at Scale: Combining Approximate PageRank and CoreRank

Figure 2 for Improving Graph Neural Networks at Scale: Combining Approximate PageRank and CoreRank

Figure 3 for Improving Graph Neural Networks at Scale: Combining Approximate PageRank and CoreRank

Abstract:Graph Neural Networks (GNNs) have achieved great successes in many learning tasks performed on graph structures. Nonetheless, to propagate information GNNs rely on a message passing scheme which can become prohibitively expensive when working with industrial-scale graphs. Inspired by the PPRGo model, we propose the CorePPR model, a scalable solution that utilises a learnable convex combination of the approximate personalised PageRank and the CoreRank to diffuse multi-hop neighbourhood information in GNNs. Additionally, we incorporate a dynamic mechanism to select the most influential neighbours for a particular node which reduces training time while preserving the performance of the model. Overall, we demonstrate that CorePPR outperforms PPRGo, particularly on large graphs where selecting the most influential nodes is particularly relevant for scalability. Our code is publicly available at: https://github.com/arielramos97/CorePPR.

* Accepted at the "NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)"

Via

Access Paper or Ask Questions

Weisfeiler and Leman go Hyperbolic: Learning Distance Preserving Node Representations

Nov 04, 2022

Giannis Nikolentzos, Michail Chatzianastasis, Michalis Vazirgiannis

Abstract:In recent years, graph neural networks (GNNs) have emerged as a promising tool for solving machine learning problems on graphs. Most GNNs are members of the family of message passing neural networks (MPNNs). There is a close connection between these models and the Weisfeiler-Leman (WL) test of isomorphism, an algorithm that can successfully test isomorphism for a broad class of graphs. Recently, much research has focused on measuring the expressive power of GNNs. For instance, it has been shown that standard MPNNs are at most as powerful as WL in terms of distinguishing non-isomorphic graphs. However, these studies have largely ignored the distances between the representations of nodes/graphs which are of paramount importance for learning tasks. In this paper, we define a distance function between nodes which is based on the hierarchy produced by the WL algorithm, and propose a model that learns representations which preserve those distances between nodes. Since the emerging hierarchy corresponds to a tree, to learn these representations, we capitalize on recent advances in the field of hyperbolic neural networks. We empirically evaluate the proposed model on standard node and graph classification datasets where it achieves competitive performance with state-of-the-art models.

Via

Access Paper or Ask Questions

Questioning the Validity of Summarization Datasets and Improving Their Factual Consistency

Oct 31, 2022

Yanzhu Guo, Chloé Clavel, Moussa Kamal Eddine, Michalis Vazirgiannis

Abstract:The topic of summarization evaluation has recently attracted a surge of attention due to the rapid development of abstractive summarization systems. However, the formulation of the task is rather ambiguous, neither the linguistic nor the natural language processing community has succeeded in giving a mutually agreed-upon definition. Due to this lack of well-defined formulation, a large number of popular abstractive summarization datasets are constructed in a manner that neither guarantees validity nor meets one of the most essential criteria of summarization: factual consistency. In this paper, we address this issue by combining state-of-the-art factual consistency models to identify the problematic instances present in popular summarization datasets. We release SummFC, a filtered summarization dataset with improved factual consistency, and demonstrate that models trained on this dataset achieve improved performance in nearly all quality aspects. We argue that our dataset should become a valid benchmark for developing and evaluating summarization systems.

* Accepted to EMNLP 2022

Via

Access Paper or Ask Questions

DATScore: Evaluating Translation with Data Augmented Translations

Oct 12, 2022

Moussa Kamal Eddine, Guokan Shang, Michalis Vazirgiannis

Figure 1 for DATScore: Evaluating Translation with Data Augmented Translations

Figure 2 for DATScore: Evaluating Translation with Data Augmented Translations

Figure 3 for DATScore: Evaluating Translation with Data Augmented Translations

Figure 4 for DATScore: Evaluating Translation with Data Augmented Translations

Abstract:The rapid development of large pretrained language models has revolutionized not only the field of Natural Language Generation (NLG) but also its evaluation. Inspired by the recent work of BARTScore: a metric leveraging the BART language model to evaluate the quality of generated text from various aspects, we introduce DATScore. DATScore uses data augmentation techniques to improve the evaluation of machine translation. Our main finding is that introducing data augmented translations of the source and reference texts is greatly helpful in evaluating the quality of the generated translation. We also propose two novel score averaging and term weighting strategies to improve the original score computing process of BARTScore. Experimental results on WMT show that DATScore correlates better with human meta-evaluations than the other recent state-of-the-art metrics, especially for low-resource languages. Ablation studies demonstrate the value added by our new scoring strategies. Moreover, we report in our extended experiments the performance of DATScore on 3 NLG tasks other than translation.

Via

Access Paper or Ask Questions

Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

Oct 11, 2022

Hadi Abdine, Moussa Kamal Eddine, Michalis Vazirgiannis, Davide Buscaldi

Figure 1 for Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

Figure 2 for Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

Figure 3 for Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

Figure 4 for Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

Abstract:Word sense induction (WSI) is a difficult problem in natural language processing that involves the unsupervised automatic detection of a word's senses (i.e. meanings). Recent work achieves significant results on the WSI task by pre-training a language model that can exclusively disambiguate word senses, whereas others employ previously pre-trained language models in conjunction with additional strategies to induce senses. In this paper, we propose a novel unsupervised method based on hierarchical clustering and invariant information clustering (IIC). The IIC is used to train a small model to optimize the mutual information between two vector representations of a target word occurring in a pair of synthetic paraphrases. This model is later used in inference mode to extract a higher quality vector representation to be used in the hierarchical clustering. We evaluate our method on two WSI tasks and in two distinct clustering configurations (fixed and dynamic number of clusters). We empirically demonstrate that, in certain cases, our approach outperforms prior WSI state-of-the-art methods, while in others, it achieves a competitive performance.

Via

Access Paper or Ask Questions

Abstractive Meeting Summarization: A Survey

Aug 08, 2022

Virgile Rennard, Guokan Shang, Julie Hunter, Michalis Vazirgiannis

Figure 1 for Abstractive Meeting Summarization: A Survey

Figure 2 for Abstractive Meeting Summarization: A Survey

Figure 3 for Abstractive Meeting Summarization: A Survey

Abstract:Recent advances in deep learning, and especially the invention of encoder-decoder architectures, has significantly improved the performance of abstractive summarization systems. While the majority of research has focused on written documents, we have observed an increasing interest in the summarization of dialogues and multi-party conversation over the past few years. A system that could reliably transform the audio or transcript of a human conversation into an abridged version that homes in on the most important points of the discussion would be valuable in a wide variety of real-world contexts, from business meetings to medical consultations to customer service calls. This paper focuses on abstractive summarization for multi-party meetings, providing a survey of the challenges, datasets and systems relevant to this task and a discussion of promising directions for future study.

Via

Access Paper or Ask Questions

Time Series Forecasting Models Copy the Past: How to Mitigate

Jul 27, 2022

Chrysoula Kosma, Giannis Nikolentzos, Nancy Xu, Michalis Vazirgiannis

Figure 1 for Time Series Forecasting Models Copy the Past: How to Mitigate

Figure 2 for Time Series Forecasting Models Copy the Past: How to Mitigate

Figure 3 for Time Series Forecasting Models Copy the Past: How to Mitigate

Figure 4 for Time Series Forecasting Models Copy the Past: How to Mitigate

Abstract:Time series forecasting is at the core of important application domains posing significant challenges to machine learning algorithms. Recently neural network architectures have been widely applied to the problem of time series forecasting. Most of these models are trained by minimizing a loss function that measures predictions' deviation from the real values. Typical loss functions include mean squared error (MSE) and mean absolute error (MAE). In the presence of noise and uncertainty, neural network models tend to replicate the last observed value of the time series, thus limiting their applicability to real-world data. In this paper, we provide a formal definition of the above problem and we also give some examples of forecasts where the problem is observed. We also propose a regularization term penalizing the replication of previously seen values. We evaluate the proposed regularization term both on synthetic and real-world datasets. Our results indicate that the regularization term mitigates to some extent the aforementioned problem and gives rise to more robust models.

* accepted at ICANN'22

Via

Access Paper or Ask Questions

Image Keypoint Matching using Graph Neural Networks

May 27, 2022

Nancy Xu, Giannis Nikolentzos, Michalis Vazirgiannis, Henrik Boström

Abstract:Image matching is a key component of many tasks in computer vision and its main objective is to find correspondences between features extracted from different natural images. When images are represented as graphs, image matching boils down to the problem of graph matching which has been studied intensively in the past. In recent years, graph neural networks have shown great potential in the graph matching task, and have also been applied to image matching. In this paper, we propose a graph neural network for the problem of image matching. The proposed method first generates initial soft correspondences between keypoints using localized node embeddings and then iteratively refines the initial correspondences using a series of graph neural network layers. We evaluate our method on natural image datasets with keypoint annotations and show that, in comparison to a state-of-the-art model, our method speeds up inference times without sacrificing prediction accuracy.

* Complex Networks

Via

Access Paper or Ask Questions

Political Communities on Twitter: Case Study of the 2022 French Presidential Election

Apr 15, 2022

Hadi Abdine, Yanzhu Guo, Virgile Rennard, Michalis Vazirgiannis

Figure 1 for Political Communities on Twitter: Case Study of the 2022 French Presidential Election

Figure 2 for Political Communities on Twitter: Case Study of the 2022 French Presidential Election

Figure 3 for Political Communities on Twitter: Case Study of the 2022 French Presidential Election

Figure 4 for Political Communities on Twitter: Case Study of the 2022 French Presidential Election

Abstract:With the significant increase in users on social media platforms, a new means of political campaigning has appeared. Twitter and Facebook are now notable campaigning tools during elections. Indeed, the candidates and their parties now take to the internet to interact and spread their ideas. In this paper, we aim to identify political communities formed on Twitter during the 2022 French presidential election and analyze each respective community. We create a large-scale Twitter dataset containing 1.2 million users and 62.6 million tweets that mention keywords relevant to the election. We perform community detection on a retweet graph of users and propose an in-depth analysis of the stance of each community. Finally, we attempt to detect offensive tweets and automatic bots, comparing across communities in order to gain insight into each candidate's supporter demographics and online campaign strategy.

Via

Access Paper or Ask Questions