Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qingquan Song

Geometric Graph Representation Learning via Maximizing Rate Reduction

Feb 13, 2022

Xiaotian Han, Zhimeng Jiang, Ninghao Liu, Qingquan Song, Jundong Li, Xia Hu

Figure 1 for Geometric Graph Representation Learning via Maximizing Rate Reduction

Figure 2 for Geometric Graph Representation Learning via Maximizing Rate Reduction

Figure 3 for Geometric Graph Representation Learning via Maximizing Rate Reduction

Figure 4 for Geometric Graph Representation Learning via Maximizing Rate Reduction

Abstract:Learning discriminative node representations benefits various downstream tasks in graph analysis such as community detection and node classification. Existing graph representation learning methods (e.g., based on random walk and contrastive learning) are limited to maximizing the local similarity of connected nodes. Such pair-wise learning schemes could fail to capture the global distribution of representations, since it has no explicit constraints on the global geometric properties of representation space. To this end, we propose Geometric Graph Representation Learning (G2R) to learn node representations in an unsupervised manner via maximizing rate reduction. In this way, G2R maps nodes in distinct groups (implicitly stored in the adjacency matrix) into different subspaces, while each subspace is compact and different subspaces are dispersedly distributed. G2R adopts a graph neural network as the encoder and maximizes the rate reduction with the adjacency matrix. Furthermore, we theoretically and empirically demonstrate that rate reduction maximization is equivalent to maximizing the principal angles between different subspaces. Experiments on real-world datasets show that G2R outperforms various baselines on node classification and community detection tasks.

* Accepted by TheWebConference(WWW) 2022

Via

Access Paper or Ask Questions

Towards Interaction Detection Using Topological Analysis on Neural Networks

Nov 04, 2020

Zirui Liu, Qingquan Song, Kaixiong Zhou, Ting Hsiang Wang, Ying Shan, Xia Hu

Figure 1 for Towards Interaction Detection Using Topological Analysis on Neural Networks

Figure 2 for Towards Interaction Detection Using Topological Analysis on Neural Networks

Figure 3 for Towards Interaction Detection Using Topological Analysis on Neural Networks

Figure 4 for Towards Interaction Detection Using Topological Analysis on Neural Networks

Abstract:Detecting statistical interactions between input features is a crucial and challenging task. Recent advances demonstrate that it is possible to extract learned interactions from trained neural networks. It has also been observed that, in neural networks, any interacting features must follow a strongly weighted connection to common hidden units. Motivated by the observation, in this paper, we propose to investigate the interaction detection problem from a novel topological perspective by analyzing the connectivity in neural networks. Specially, we propose a new measure for quantifying interaction strength, based upon the well-received theory of persistent homology. Based on this measure, a Persistence Interaction detection~(PID) algorithm is developed to efficiently detect interactions. Our proposed algorithm is evaluated across a number of interaction detection tasks on several synthetic and real world datasets with different hyperparameters. Experimental results validate that the PID algorithm outperforms the state-of-the-art baselines.

Via

Access Paper or Ask Questions

Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction

Jun 29, 2020

Qingquan Song, Dehua Cheng, Hanning Zhou, Jiyan Yang, Yuandong Tian, Xia Hu

Figure 1 for Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction

Figure 2 for Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction

Figure 3 for Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction

Figure 4 for Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction

Abstract:Click-Through Rate (CTR) prediction is one of the most important machine learning tasks in recommender systems, driving personalized experience for billions of consumers. Neural architecture search (NAS), as an emerging field, has demonstrated its capabilities in discovering powerful neural network architectures, which motivates us to explore its potential for CTR predictions. Due to 1) diverse unstructured feature interactions, 2) heterogeneous feature space, and 3) high data volume and intrinsic data randomness, it is challenging to construct, search, and compare different architectures effectively for recommendation models. To address these challenges, we propose an automated interaction architecture discovering framework for CTR prediction named AutoCTR. Via modularizing simple yet representative interactions as virtual building blocks and wiring them into a space of direct acyclic graphs, AutoCTR performs evolutionary architecture exploration with learning-to-rank guidance at the architecture level and achieves acceleration using low-fidelity model. Empirical analysis demonstrates the effectiveness of AutoCTR on different datasets comparing to human-crafted architectures. The discovered architecture also enjoys generalizability and transferability among different datasets.

Via

Access Paper or Ask Questions

AutoRec: An Automated Recommender System

Jun 26, 2020

Ting-Hsiang Wang, Qingquan Song, Xiaotian Han, Zirui Liu, Haifeng Jin, Xia Hu

Figure 1 for AutoRec: An Automated Recommender System

Figure 2 for AutoRec: An Automated Recommender System

Figure 3 for AutoRec: An Automated Recommender System

Abstract:Realistic recommender systems are often required to adapt to ever-changing data and tasks or to explore different models systematically. To address the need, we present AutoRec, an open-source automated machine learning (AutoML) platform extended from the TensorFlow ecosystem and, to our knowledge, the first framework to leverage AutoML for model search and hyperparameter tuning in deep recommendation models. AutoRec also supports a highly flexible pipeline that accommodates both sparse and dense inputs, rating prediction and click-through rate (CTR) prediction tasks, and an array of recommendation models. Lastly, AutoRec provides a simple, user-friendly API. Experiments conducted on the benchmark datasets reveal AutoRec is reliable and can identify models which resemble the best model without prior knowledge.

Via

Access Paper or Ask Questions

Multi-Channel Graph Convolutional Networks

Dec 17, 2019

Kaixiong Zhou, Qingquan Song, Xiao Huang, Daochen Zha, Na Zou, Xia Hu

Figure 1 for Multi-Channel Graph Convolutional Networks

Figure 2 for Multi-Channel Graph Convolutional Networks

Figure 3 for Multi-Channel Graph Convolutional Networks

Figure 4 for Multi-Channel Graph Convolutional Networks

Abstract:Graph neural networks (GNN) has been demonstrated to be effective in classifying graph structures. To further improve the graph representation learning ability, hierarchical GNN has been explored. It leverages the differentiable pooling to cluster nodes into fixed groups, and generates a coarse-grained structure accompanied with the shrinking of the original graph. However, such clustering would discard some graph information and achieve the suboptimal results. It is because the node inherently has different characteristics or roles, and two non-isomorphic graphs may have the same coarse-grained structure that cannot be distinguished after pooling. To compensate the loss caused by coarse-grained clustering and further advance GNN, we propose a multi-channel graph convolutional networks (MuchGCN). It is motivated by the convolutional neural networks, at which a series of channels are encoded to preserve the comprehensive characteristics of the input image. Thus, we define the specific graph convolutions to learn a series of graph channels at each layer, and pool graphs iteratively to encode the hierarchical structures. Experiments have been carefully carried out to demonstrate the superiority of MuchGCN over the state-of-the-art graph classification algorithms.

Via

Access Paper or Ask Questions

Sub-Architecture Ensemble Pruning in Neural Architecture Search

Oct 01, 2019

Yijun Bian, Qingquan Song, Mengnan Du, Jun Yao, Huanhuan Chen, Xia Hu

Figure 1 for Sub-Architecture Ensemble Pruning in Neural Architecture Search

Figure 2 for Sub-Architecture Ensemble Pruning in Neural Architecture Search

Figure 3 for Sub-Architecture Ensemble Pruning in Neural Architecture Search

Figure 4 for Sub-Architecture Ensemble Pruning in Neural Architecture Search

Abstract:Neural architecture search (NAS) is gaining more and more attention in recent years due to its flexibility and the remarkable capability of reducing the burden of neural network design. To achieve better performance, however, the searching process usually costs massive computation, which might not be affordable to researchers and practitioners. While recent attempts have employed ensemble learning methods to mitigate the enormous computation, an essential characteristic of diversity in ensemble methods is missed out, causing more similar sub-architectures to be gathered and potential redundancy in the final ensemble architecture. To bridge this gap, we propose a pruning method for NAS ensembles, named as ''Sub-Architecture Ensemble Pruning in Neural Architecture Search (SAEP).'' It targets to utilize diversity and achieve sub-ensemble architectures in a smaller size with comparable performance to the unpruned ensemble architectures. Three possible solutions are proposed to decide which subarchitectures should be pruned during the searching process. Experimental results demonstrate the effectiveness of the proposed method in largely reducing the size of ensemble architectures while maintaining the final performance. Moreover, distinct deeper architectures could be discovered if the searched sub-architectures are not diverse enough.

* This work was done when the first author was a visiting research scholar at Texas A&M University

Via

Access Paper or Ask Questions

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Sep 10, 2019

Kaixiong Zhou, Qingquan Song, Xiao Huang, Xia Hu

Figure 1 for Auto-GNN: Neural Architecture Search of Graph Neural Networks

Figure 2 for Auto-GNN: Neural Architecture Search of Graph Neural Networks

Figure 3 for Auto-GNN: Neural Architecture Search of Graph Neural Networks

Figure 4 for Auto-GNN: Neural Architecture Search of Graph Neural Networks

Abstract:Graph neural networks (GNN) has been successfully applied to operate on the graph-structured data. Given a specific scenario, rich human expertise and tremendous laborious trials are usually required to identify a suitable GNN architecture. It is because the performance of a GNN architecture is significantly affected by the choice of graph convolution components, such as aggregate function and hidden dimension. Neural architecture search (NAS) has shown its potential in discovering effective deep architectures for learning tasks in image and language modeling. However, existing NAS algorithms cannot be directly applied to the GNN search problem. First, the search space of GNN is different from the ones in existing NAS work. Second, the representation learning capacity of GNN architecture changes obviously with slight architecture modifications. It affects the search efficiency of traditional search methods. Third, widely used techniques in NAS such as parameter sharing might become unstable in GNN. To bridge the gap, we propose the automated graph neural networks (AGNN) framework, which aims to find an optimal GNN architecture within a predefined search space. A reinforcement learning based controller is designed to greedily validate architectures via small steps. AGNN has a novel parameter sharing strategy that enables homogeneous architectures to share parameters, based on a carefully-designed homogeneity definition. Experiments on real-world benchmark datasets demonstrate that the GNN architecture identified by AGNN achieves the best performance, comparing with existing handcrafted models and tradistional search methods.

Via

Access Paper or Ask Questions

Techniques for Automated Machine Learning

Jul 21, 2019

Yi-Wei Chen, Qingquan Song, Xia Hu

Figure 1 for Techniques for Automated Machine Learning

Figure 2 for Techniques for Automated Machine Learning

Figure 3 for Techniques for Automated Machine Learning

Figure 4 for Techniques for Automated Machine Learning

Abstract:Automated machine learning (AutoML) aims to find optimal machine learning solutions automatically given a machine learning problem. It could release the burden of data scientists from the multifarious manual tuning process and enable the access of domain experts to the off-the-shelf machine learning solutions without extensive experience. In this paper, we review the current developments of AutoML in terms of three categories, automated feature engineering (AutoFE), automated model and hyperparameter learning (AutoMHL), and automated deep learning (AutoDL). State-of-the-art techniques adopted in the three categories are presented, including Bayesian optimization, reinforcement learning, evolutionary algorithm, and gradient-based approaches. We summarize popular AutoML frameworks and conclude with current open challenges of AutoML.

Via

Access Paper or Ask Questions

Coupled Variational Recurrent Collaborative Filtering

Jun 11, 2019

Qingquan Song, Shiyu Chang, Xia Hu

Figure 1 for Coupled Variational Recurrent Collaborative Filtering

Figure 2 for Coupled Variational Recurrent Collaborative Filtering

Figure 3 for Coupled Variational Recurrent Collaborative Filtering

Figure 4 for Coupled Variational Recurrent Collaborative Filtering

Abstract:We focus on the problem of streaming recommender system and explore novel collaborative filtering algorithms to handle the data dynamicity and complexity in a streaming manner. Although deep neural networks have demonstrated the effectiveness of recommendation tasks, it is lack of explorations on integrating probabilistic models and deep architectures under streaming recommendation settings. Conjoining the complementary advantages of probabilistic models and deep neural networks could enhance both model effectiveness and the understanding of inference uncertainties. To bridge the gap, in this paper, we propose a Coupled Variational Recurrent Collaborative Filtering (CVRCF) framework based on the idea of Deep Bayesian Learning to handle the streaming recommendation problem. The framework jointly combines stochastic processes and deep factorization models under a Bayesian paradigm to model the generation and evolution of users' preferences and items' popularities. To ensure efficient optimization and streaming update, we further propose a sequential variational inference algorithm based on a cross variational recurrent neural network structure. Experimental results on three benchmark datasets demonstrate that the proposed framework performs favorably against the state-of-the-art methods in terms of both temporal dependency modeling and predictive accuracy. The learned latent variables also provide visualized interpretations for the evolution of temporal dynamics.

Via

Access Paper or Ask Questions

Multi-Label Adversarial Perturbations

Jan 02, 2019

Qingquan Song, Haifeng Jin, Xiao Huang, Xia Hu

Figure 1 for Multi-Label Adversarial Perturbations

Figure 2 for Multi-Label Adversarial Perturbations

Figure 3 for Multi-Label Adversarial Perturbations

Figure 4 for Multi-Label Adversarial Perturbations

Abstract:Adversarial examples are delicately perturbed inputs, which aim to mislead machine learning models towards incorrect outputs. While most of the existing work focuses on generating adversarial perturbations in multi-class classification problems, many real-world applications fall into the multi-label setting in which one instance could be associated with more than one label. For example, a spammer may generate adversarial spams with malicious advertising while maintaining the other labels such as topic labels unchanged. To analyze the vulnerability and robustness of multi-label learning models, we investigate the generation of multi-label adversarial perturbations. This is a challenging task due to the uncertain number of positive labels associated with one instance, as well as the fact that multiple labels are usually not mutually exclusive with each other. To bridge this gap, in this paper, we propose a general attacking framework targeting on multi-label classification problem and conduct a premier analysis on the perturbations for deep neural networks. Leveraging the ranking relationships among labels, we further design a ranking-based framework to attack multi-label ranking algorithms. We specify the connection between the two proposed frameworks and separately design two specific methods grounded on each of them to generate targeted multi-label perturbations. Experiments on real-world multi-label image classification and ranking problems demonstrate the effectiveness of our proposed frameworks and provide insights of the vulnerability of multi-label deep learning models under diverse targeted attacking strategies. Several interesting findings including an unpolished defensive strategy, which could potentially enhance the interpretability and robustness of multi-label deep learning models, are further presented and discussed at the end.

Via

Access Paper or Ask Questions