Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaojie Guo

Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation

Oct 20, 2021
Qiming Hu, Xiaojie Guo

Figure 1 for Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation

Figure 2 for Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation

Figure 3 for Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation

Figure 4 for Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation

Single image reflection separation (SIRS), as a representative blind source separation task, aims to recover two layers, $\textit{i.e.}$, transmission and reflection, from one mixed observation, which is challenging due to the highly ill-posed nature. Existing deep learning based solutions typically restore the target layers individually, or with some concerns at the end of the output, barely taking into account the interaction across the two streams/branches. In order to utilize information more efficiently, this work presents a general yet simple interactive strategy, namely $\textit{your trash is my treasure}$ (YTMT), for constructing dual-stream decomposition networks. To be specific, we explicitly enforce the two streams to communicate with each other block-wisely. Inspired by the additive property between the two components, the interactive path can be easily built via transferring, instead of discarding, deactivated information by the ReLU rectifier from one stream to the other. Both ablation studies and experimental results on widely-used SIRS datasets are conducted to demonstrate the efficacy of YTMT, and reveal its superiority over other state-of-the-art alternatives. The implementation is quite simple and our code is publicly available at $\href{https://github.com/mingcv/YTMT-Strategy}{\textit{https://github.com/mingcv/YTMT-Strategy}}$.

* Accepted to NeurIPS2021

Via

Access Paper or Ask Questions

Graph Neural Networks for Natural Language Processing: A Survey

Jun 10, 2021
Lingfei Wu, Yu Chen, Kai Shen, Xiaojie Guo, Hanning Gao, Shucheng Li, Jian Pei, Bo Long

Figure 1 for Graph Neural Networks for Natural Language Processing: A Survey

Figure 2 for Graph Neural Networks for Natural Language Processing: A Survey

Figure 3 for Graph Neural Networks for Natural Language Processing: A Survey

Figure 4 for Graph Neural Networks for Natural Language Processing: A Survey

Deep learning has become the dominant approach in coping with various tasks in Natural LanguageProcessing (NLP). Although text inputs are typically represented as a sequence of tokens, there isa rich variety of NLP problems that can be best expressed with a graph structure. As a result, thereis a surge of interests in developing new deep learning techniques on graphs for a large numberof NLP tasks. In this survey, we present a comprehensive overview onGraph Neural Networks(GNNs) for Natural Language Processing. We propose a new taxonomy of GNNs for NLP, whichsystematically organizes existing research of GNNs for NLP along three axes: graph construction,graph representation learning, and graph based encoder-decoder models. We further introducea large number of NLP applications that are exploiting the power of GNNs and summarize thecorresponding benchmark datasets, evaluation metrics, and open-source codes. Finally, we discussvarious outstanding challenges for making the full use of GNNs for NLP as well as future researchdirections. To the best of our knowledge, this is the first comprehensive overview of Graph NeuralNetworks for Natural Language Processing.

* 127 pages

Via

Access Paper or Ask Questions

A Systematic Survey on Deep Generative Models for Graph Generation

Jul 18, 2020
Xiaojie Guo, Liang Zhao

Figure 1 for A Systematic Survey on Deep Generative Models for Graph Generation

Figure 2 for A Systematic Survey on Deep Generative Models for Graph Generation

Figure 3 for A Systematic Survey on Deep Generative Models for Graph Generation

Figure 4 for A Systematic Survey on Deep Generative Models for Graph Generation

Graphs are important data representations for describing objects and their relationships, which appear in a wide diversity of real-world scenarios. As one of a critical problem in this area, graph generation considers learning the distributions of given graphs and generating more novel graphs. Owing to its wide range of applications, generative models for graphs have a rich history, which, however, are traditionally hand-crafted and only capable of modeling a few statistical properties of graphs. Recent advances in deep generative models for graph generation is an important step towards improving the fidelity of generated graphs and paves the way for new kinds of applications. This article provides an extensive overview of the literature in the field of deep generative models for the graph generation. Firstly, the formal definition of deep generative models for the graph generation as well as preliminary knowledge is provided. Secondly, two taxonomies of deep generative models for unconditional, and conditional graph generation respectively are proposed; the existing works of each are compared and analyzed. After that, an overview of the evaluation metrics in this specific domain is provided. Finally, the applications that deep graph generation enables are summarized and five promising future research directions are highlighted.

Via

Access Paper or Ask Questions

Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

Jun 09, 2020
Xiaojie Guo, Liang Zhao, Zhao Qin, Lingfei Wu, Amarda Shehu, Yanfang Ye

Figure 1 for Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

Figure 2 for Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

Figure 3 for Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

Figure 4 for Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

Disentangled representation learning has recently attracted a significant amount of attention, particularly in the field of image representation learning. However, learning the disentangled representations behind a graph remains largely unexplored, especially for the attributed graph with both node and edge features. Disentanglement learning for graph generation has substantial new challenges including 1) the lack of graph deconvolution operations to jointly decode node and edge attributes; and 2) the difficulty in enforcing the disentanglement among latent factors that respectively influence: i) only nodes, ii) only edges, and iii) joint patterns between them. To address these challenges, we propose a new disentanglement enhancement framework for deep generative models for attributed graphs. In particular, a novel variational objective is proposed to disentangle the above three types of latent factors, with novel architecture for node and edge deconvolutions. Moreover, within each type, individual-factor-wise disentanglement is further enhanced, which is shown to be a generalization of the existing framework for images. Qualitative and quantitative experiments on both synthetic and real-world datasets demonstrate the effectiveness of the proposed model and its extensions.

* This paper has been accepted by KDD 2020

Via

Access Paper or Ask Questions

Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

Apr 08, 2020
Xiaojie Guo, Sivani Tadepalli, Liang Zhao, Amarda Shehu

Figure 1 for Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

Figure 2 for Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

Figure 3 for Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

Figure 4 for Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

Much scientific enquiry across disciplines is founded upon a mechanistic treatment of dynamic systems that ties form to function. A highly visible instance of this is in molecular biology, where an important goal is to determine functionally-relevant forms/structures that a protein molecule employs to interact with molecular partners in the living cell. This goal is typically pursued under the umbrella of stochastic optimization with algorithms that optimize a scoring function. Research repeatedly shows that current scoring function, though steadily improving, correlate weakly with molecular activity. Inspired by recent momentum in generative deep learning, this paper proposes and evaluates an alternative approach to generating functionally-relevant three-dimensional structures of a protein. Though typically deep generative models struggle with highly-structured data, the work presented here circumvents this challenge via graph-generative models. A comprehensive evaluation of several deep architectures shows the promise of generative models in directly revealing the latent space for sampling novel tertiary structures, as well as in highlighting axes/factors that carry structural meaning and open the black box often associated with deep models. The work presented here is a first step towards interpretative, deep generative models becoming viable and informative complementary approaches to protein structure prediction.

Via

Access Paper or Ask Questions

Deep Multi-attributed Graph Translation with Node-Edge Co-evolution

Mar 22, 2020
Xiaojie Guo, Liang Zhao, Cameron Nowzari, Setareh Rafatirad, Houman Homayoun, Sai Manoj Pudukotai Dinakarrao

Figure 1 for Deep Multi-attributed Graph Translation with Node-Edge Co-evolution

Figure 2 for Deep Multi-attributed Graph Translation with Node-Edge Co-evolution

Figure 3 for Deep Multi-attributed Graph Translation with Node-Edge Co-evolution

Figure 4 for Deep Multi-attributed Graph Translation with Node-Edge Co-evolution

Generalized from image and language translation, graph translation aims to generate a graph in the target domain by conditioning an input graph in the source domain. This promising topic has attracted fast-increasing attention recently. Existing works are limited to either merely predicting the node attributes of graphs with fixed topology or predicting only the graph topology without considering node attributes, but cannot simultaneously predict both of them, due to substantial challenges: 1) difficulty in characterizing the interactive, iterative, and asynchronous translation process of both nodes and edges and 2) difficulty in discovering and maintaining the inherent consistency between the node and edge in predicted graphs. These challenges prevent a generic, end-to-end framework for joint node and edge attributes prediction, which is a need for real-world applications such as malware confinement in IoT networks and structural-to-functional network translation. These real-world applications highly depend on hand-crafting and ad-hoc heuristic models, but cannot sufficiently utilize massive historical data. In this paper, we termed this generic problem "multi-attributed graph translation" and developed a novel framework integrating both node and edge translations seamlessly. The novel edge translation path is generic, which is proven to be a generalization of the existing topology translation models. Then, a spectral graph regularization based on our non-parametric graph Laplacian is proposed in order to learn and maintain the consistency of the predicted nodes and edges. Finally, extensive experiments on both synthetic and real-world application data demonstrated the effectiveness of the proposed method.

* International Conference on Data Mining (ICDM), Beijing, China, 2019, pp. 250-259
* This paper has been accepted by International Conference on Data Mining (ICDM), Beijing, China, 2019

Via

Access Paper or Ask Questions

LaFIn: Generative Landmark Guided Face Inpainting

Nov 26, 2019
Yang Yang, Xiaojie Guo, Jiayi Ma, Lin Ma, Haibin Ling

Figure 1 for LaFIn: Generative Landmark Guided Face Inpainting

Figure 2 for LaFIn: Generative Landmark Guided Face Inpainting

Figure 3 for LaFIn: Generative Landmark Guided Face Inpainting

Figure 4 for LaFIn: Generative Landmark Guided Face Inpainting

It is challenging to inpaint face images in the wild, due to the large variation of appearance, such as different poses, expressions and occlusions. A good inpainting algorithm should guarantee the realism of output, including the topological structure among eyes, nose and mouth, as well as the attribute consistency on pose, gender, ethnicity, expression, etc. This paper studies an effective deep learning based strategy to deal with these issues, which comprises of a facial landmark predicting subnet and an image inpainting subnet. Concretely, given partial observation, the landmark predictor aims to provide the structural information (e.g. topological relationship and expression) of incomplete faces, while the inpaintor is to generate plausible appearance (e.g. gender and ethnicity) conditioned on the predicted landmarks. Experiments on the CelebA-HQ and CelebA datasets are conducted to reveal the efficacy of our design and, to demonstrate its superiority over state-of-the-art alternatives both qualitatively and quantitatively. In addition, we assume that high-quality completed faces together with their landmarks can be utilized as augmented data to further improve the performance of (any) landmark predictor, which is corroborated by experimental results on the 300W and WFLW datasets.

Via

Access Paper or Ask Questions

EDIT: Exemplar-Domain Aware Image-to-Image Translation

Nov 24, 2019
Yuanbin Fu, Jiayi Ma, Lin Ma, Xiaojie Guo

Figure 1 for EDIT: Exemplar-Domain Aware Image-to-Image Translation

Figure 2 for EDIT: Exemplar-Domain Aware Image-to-Image Translation

Figure 3 for EDIT: Exemplar-Domain Aware Image-to-Image Translation

Figure 4 for EDIT: Exemplar-Domain Aware Image-to-Image Translation

Image-to-image translation is to convert an image of the certain style to another of the target style with the content preserved. A desired translator should be capable to generate diverse results in a controllable (many-to-many) fashion. To this end, we design a novel generative adversarial network, namely exemplar-domain aware image-to-image translator (EDIT for short). The principle behind is that, for images from multiple domains, the content features can be obtained by a uniform extractor, while (re-)stylization is achieved by mapping the extracted features specifically to different purposes (domains and exemplars). The generator of our EDIT comprises of a part of blocks configured by shared parameters, and the rest by varied parameters exported by an exemplar-domain aware parameter network. In addition, a discriminator is equipped during the training phase to guarantee the output satisfying the distribution of the target domain. Our EDIT can flexibly and effectively work on multiple domains and arbitrary exemplars in a unified neat model. We conduct experiments to show the efficacy of our design, and reveal its advances over other state-of-the-art methods both quantitatively and qualitatively.

* 9 pages, 7 figures

Via

Access Paper or Ask Questions

Multi-stage Deep Classifier Cascades for Open World Recognition

Aug 26, 2019
Xiaojie Guo, Amir Alipour-Fanid, Lingfei Wu, Hemant Purohit, Xiang Chen, Kai Zeng, Liang Zhao

Figure 1 for Multi-stage Deep Classifier Cascades for Open World Recognition

Figure 2 for Multi-stage Deep Classifier Cascades for Open World Recognition

Figure 3 for Multi-stage Deep Classifier Cascades for Open World Recognition

Figure 4 for Multi-stage Deep Classifier Cascades for Open World Recognition

At present, object recognition studies are mostly conducted in a closed lab setting with classes in test phase typically in training phase. However, real-world problem is far more challenging because: i) new classes unseen in the training phase can appear when predicting; ii) discriminative features need to evolve when new classes emerge in real time; and iii) instances in new classes may not follow the "independent and identically distributed" (iid) assumption. Most existing work only aims to detect the unknown classes and is incapable of continuing to learn newer classes. Although a few methods consider both detecting and including new classes, all are based on the predefined handcrafted features that cannot evolve and are out-of-date for characterizing emerging classes. Thus, to address the above challenges, we propose a novel generic end-to-end framework consisting of a dynamic cascade of classifiers that incrementally learn their dynamic and inherent features. The proposed method injects dynamic elements into the system by detecting instances from unknown classes, while at the same time incrementally updating the model to include the new classes. The resulting cascade tree grows by adding a new leaf node classifier once a new class is detected, and the discriminative features are updated via an end-to-end learning strategy. Experiments on two real-world datasets demonstrate that our proposed method outperforms existing state-of-the-art methods.

* This paper has been accepted by CIKM 2019

Via

Access Paper or Ask Questions