Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mengdi Zhang

Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Aug 16, 2022
Xiao Liu, Shiyu Zhao, Kai Su, Yukuo Cen, Jiezhong Qiu, Mengdi Zhang, Wei Wu, Yuxiao Dong, Jie Tang

Figure 1 for Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Figure 2 for Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Figure 3 for Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Figure 4 for Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Knowledge graph (KG) embeddings have been a mainstream approach for reasoning over incomplete KGs. However, limited by their inherently shallow and static architectures, they can hardly deal with the rising focus on complex logical queries, which comprise logical operators, imputed edges, multiple source entities, and unknown intermediate entities. In this work, we present the Knowledge Graph Transformer (kgTransformer) with masked pre-training and fine-tuning strategies. We design a KG triple transformation method to enable Transformer to handle KGs, which is further strengthened by the Mixture-of-Experts (MoE) sparse activation. We then formulate the complex logical queries as masked prediction and introduce a two-stage masked pre-training strategy to improve transferability and generalizability. Extensive experiments on two benchmarks demonstrate that kgTransformer can consistently outperform both KG embedding-based baselines and advanced encoders on nine in-domain and out-of-domain reasoning tasks. Additionally, kgTransformer can reason with explainability via providing the full reasoning paths to interpret given answers.

* kgTransformer; Accepted to KDD 2022

Via

Access Paper or Ask Questions

Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation

Aug 01, 2022
Huixuan Chi, Hao Xu, Hao Fu, Mengya Liu, Mengdi Zhang, Yuji Yang, Qinfen Hao, Wei Wu

Figure 1 for Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation

Figure 2 for Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation

Figure 3 for Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation

Figure 4 for Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation

Modeling the evolution of user preference is essential in recommender systems. Recently, dynamic graph-based methods have been studied and achieved SOTA for recommendation, majority of which focus on user's stable long-term preference. However, in real-world scenario, user's short-term preference evolves over time dynamically. Although there exists sequential methods that attempt to capture it, how to model the evolution of short-term preference with dynamic graph-based methods has not been well-addressed yet. In particular: 1) existing methods do not explicitly encode and capture the evolution of short-term preference as sequential methods do; 2) simply using last few interactions is not enough for modeling the changing trend. In this paper, we propose Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation (LSTSR) to capture the evolution of short-term preference under dynamic graph. Specifically, we explicitly encode short-term preference and optimize it via memory mechanism, which has three key operations: Message, Aggregate and Update. Our memory mechanism can not only store one-hop information, but also trigger with new interactions online. Extensive experiments conducted on five public datasets show that LSTSR consistently outperforms many state-of-the-art recommendation methods across various lines.

* 9 pages, 4 figures

Via

Access Paper or Ask Questions

Ensemble Multi-Relational Graph Neural Networks

May 24, 2022
Yuling Wang, Hao Xu, Yanhua Yu, Mengdi Zhang, Zhenhao Li, Yuji Yang, Wei Wu

Figure 1 for Ensemble Multi-Relational Graph Neural Networks

Figure 2 for Ensemble Multi-Relational Graph Neural Networks

Figure 3 for Ensemble Multi-Relational Graph Neural Networks

Figure 4 for Ensemble Multi-Relational Graph Neural Networks

It is well established that graph neural networks (GNNs) can be interpreted and designed from the perspective of optimization objective. With this clear optimization objective, the deduced GNNs architecture has sound theoretical foundation, which is able to flexibly remedy the weakness of GNNs. However, this optimization objective is only proved for GNNs with single-relational graph. Can we infer a new type of GNNs for multi-relational graphs by extending this optimization objective, so as to simultaneously solve the issues in previous multi-relational GNNs, e.g., over-parameterization? In this paper, we propose a novel ensemble multi-relational GNNs by designing an ensemble multi-relational (EMR) optimization objective. This EMR optimization objective is able to derive an iterative updating rule, which can be formalized as an ensemble message passing (EnMP) layer with multi-relations. We further analyze the nice properties of EnMP layer, e.g., the relationship with multi-relational personalized PageRank. Finally, a new multi-relational GNNs which well alleviate the over-smoothing and over-parameterization issues are proposed. Extensive experiments conducted on four benchmark datasets well demonstrate the effectiveness of the proposed model.

* 7 pages, 3 figures, published to IJCAI 2022

Via

Access Paper or Ask Questions

Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification

May 18, 2022
Kai Zhang, Qi Liu, Zhenya Huang, Mingyue Cheng, Kun Zhang, Mengdi Zhang, Wei Wu, Enhong Chen

Figure 1 for Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification

Figure 2 for Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification

Figure 3 for Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification

Figure 4 for Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification

Cross-domain sentiment classification (CDSC) aims to use the transferable semantics learned from the source domain to predict the sentiment of reviews in the unlabeled target domain. Existing studies in this task attach more attention to the sequence modeling of sentences while largely ignoring the rich domain-invariant semantics embedded in graph structures (i.e., the part-of-speech tags and dependency relations). As an important aspect of exploring characteristics of language comprehension, adaptive graph representations have played an essential role in recent years. To this end, in the paper, we aim to explore the possibility of learning invariant semantic features from graph-like structures in CDSC. Specifically, we present Graph Adaptive Semantic Transfer (GAST) model, an adaptive syntactic graph embedding method that is able to learn domain-invariant semantics from both word sequences and syntactic graphs. More specifically, we first raise a POS-Transformer module to extract sequential semantic features from the word sequences as well as the part-of-speech tags. Then, we design a Hybrid Graph Attention (HGAT) module to generate syntax-based semantic features by considering the transferable dependency relations. Finally, we devise an Integrated aDaptive Strategy (IDS) to guide the joint learning process of both modules. Extensive experiments on four public datasets indicate that GAST achieves comparable effectiveness to a range of state-of-the-art models.

* 11 pages, 7 figures, Sigir 2022

Via

Access Paper or Ask Questions

Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis

Mar 30, 2022
Kai Zhang, Kun Zhang, Mengdi Zhang, Hongke Zhao, Qi Liu, Wei Wu, Enhong Chen

Figure 1 for Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis

Figure 2 for Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis

Figure 3 for Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis

Figure 4 for Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis

Aspect-based sentiment analysis (ABSA) predicts sentiment polarity towards a specific aspect in the given sentence. While pre-trained language models such as BERT have achieved great success, incorporating dynamic semantic changes into ABSA remains challenging. To this end, in this paper, we propose to address this problem by Dynamic Re-weighting BERT (DR-BERT), a novel method designed to learn dynamic aspect-oriented semantics for ABSA. Specifically, we first take the Stack-BERT layers as a primary encoder to grasp the overall semantic of the sentence and then fine-tune it by incorporating a lightweight Dynamic Re-weighting Adapter (DRA). Note that the DRA can pay close attention to a small region of the sentences at each step and re-weigh the vitally important words for better aspect-aware sentiment understanding. Finally, experimental results on three benchmark datasets demonstrate the effectiveness and the rationality of our proposed model and provide good interpretable insights for future semantic modeling.

* 14 pages, 6 figures

Via

Access Paper or Ask Questions

Learning the Implicit Semantic Representation on Graph-Structured Data

Jan 16, 2021
Likang Wu, Zhi Li, Hongke Zhao, Qi Liu, Jun Wang, Mengdi Zhang, Enhong Chen

Figure 1 for Learning the Implicit Semantic Representation on Graph-Structured Data

Figure 2 for Learning the Implicit Semantic Representation on Graph-Structured Data

Figure 3 for Learning the Implicit Semantic Representation on Graph-Structured Data

Figure 4 for Learning the Implicit Semantic Representation on Graph-Structured Data

Existing representation learning methods in graph convolutional networks are mainly designed by describing the neighborhood of each node as a perceptual whole, while the implicit semantic associations behind highly complex interactions of graphs are largely unexploited. In this paper, we propose a Semantic Graph Convolutional Networks (SGCN) that explores the implicit semantics by learning latent semantic-paths in graphs. In previous work, there are explorations of graph semantics via meta-paths. However, these methods mainly rely on explicit heterogeneous information that is hard to be obtained in a large amount of graph-structured data. SGCN first breaks through this restriction via leveraging the semantic-paths dynamically and automatically during the node aggregating process. To evaluate our idea, we conduct sufficient experiments on several standard datasets, and the empirical results show the superior performance of our model.

* 16 pages, DASFAA 2021

Via

Access Paper or Ask Questions

Polarized Reflection Removal with Perfect Alignment in the Wild

Mar 28, 2020
Chenyang Lei, Xuhua Huang, Mengdi Zhang, Qiong Yan, Wenxiu Sun, Qifeng Chen

Figure 1 for Polarized Reflection Removal with Perfect Alignment in the Wild

Figure 2 for Polarized Reflection Removal with Perfect Alignment in the Wild

Figure 3 for Polarized Reflection Removal with Perfect Alignment in the Wild

Figure 4 for Polarized Reflection Removal with Perfect Alignment in the Wild

We present a novel formulation to removing reflection from polarized images in the wild. We first identify the misalignment issues of existing reflection removal datasets where the collected reflection-free images are not perfectly aligned with input mixed images due to glass refraction. Then we build a new dataset with more than 100 types of glass in which obtained transmission images are perfectly aligned with input mixed images. Second, capitalizing on the special relationship between reflection and polarized light, we propose a polarized reflection removal model with a two-stage architecture. In addition, we design a novel perceptual NCC loss that can improve the performance of reflection removal and general image decomposition tasks. We conduct extensive experiments, and results suggest that our model outperforms state-of-the-art methods on reflection removal.

* CVPR2020

Via

Access Paper or Ask Questions

Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

May 11, 2019
Hongwei Wang, Fuzheng Zhang, Mengdi Zhang, Jure Leskovec, Miao Zhao, Wenjie Li, Zhongyuan Wang

Figure 1 for Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Figure 2 for Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Figure 3 for Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Figure 4 for Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Knowledge graphs capture interlinked information between entities and they represent an attractive source of structured information that can be harnessed for recommender systems. However, existing recommender engines use knowledge graphs by manually designing features, do not allow for end-to-end training, or provide poor scalability. Here we propose Knowledge Graph Convolutional Networks (KGCN), an end-to-end trainable framework that harnesses item relationships captured by the knowledge graph to provide better recommendations. Conceptually, KGCN computes user-specific item embeddings by first applying a trainable function that identifies important knowledge graph relations for a given user and then transforming the knowledge graph into a user-specific weighted graph. Then, KGCN applies a graph convolutional neural network that computes an embedding of an item node by propagating and aggregating knowledge graph neighborhood information. Moreover, to provide better inductive bias KGCN uses label smoothness (LS), which provides regularization over edge weights and we prove that it is equivalent to label propagation scheme on a graph. Finally, We unify KGCN and LS regularization, and present a scalable minibatch implementation for KGCN-LS model. Experiments show that KGCN-LS outperforms strong baselines in four datasets. KGCN-LS also achieves great performance in sparse scenarios and is highly scalable with respect to the knowledge graph size.

* KDD 2019 research track oral

Via

Access Paper or Ask Questions

Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

Apr 17, 2017
Zhe Liu, Anbang Xu, Mengdi Zhang, Jalal Mahmud, Vibha Sinha

Figure 1 for Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

Figure 2 for Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

Figure 3 for Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

Figure 4 for Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

One problem that every presenter faces when delivering a public discourse is how to hold the listeners' attentions or to keep them involved. Therefore, many studies in conversation analysis work on this issue and suggest qualitatively con-structions that can effectively lead to audience's applause. To investigate these proposals quantitatively, in this study we an-alyze the transcripts of 2,135 TED Talks, with a particular fo-cus on the rhetorical devices that are used by the presenters for applause elicitation. Through conducting regression anal-ysis, we identify and interpret 24 rhetorical devices as triggers of audience applauding. We further build models that can rec-ognize applause-evoking sentences and conclude this work with potential implications.

Via

Access Paper or Ask Questions