Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Philip S. Yu

Cross-lingual COVID-19 Fake News Detection

Oct 14, 2021
Jiangshu Du, Yingtong Dou, Congying Xia, Limeng Cui, Jing Ma, Philip S. Yu

Figure 1 for Cross-lingual COVID-19 Fake News Detection

Figure 2 for Cross-lingual COVID-19 Fake News Detection

Figure 3 for Cross-lingual COVID-19 Fake News Detection

The COVID-19 pandemic poses a great threat to global public health. Meanwhile, there is massive misinformation associated with the pandemic which advocates unfounded or unscientific claims. Even major social media and news outlets have made an extra effort in debunking COVID-19 misinformation, most of the fact-checking information is in English, whereas some unmoderated COVID-19 misinformation is still circulating in other languages, threatening the health of less-informed people in immigrant communities and developing countries. In this paper, we make the first attempt to detect COVID-19 misinformation in a low-resource language (Chinese) only using the fact-checked news in a high-resource language (English). We start by curating a Chinese real&fake news dataset according to existing fact-checking information. Then, we propose a deep learning framework named CrossFake to jointly encode the cross-lingual news body texts and capture the news content as much as possible. Empirical results on our dataset demonstrate the effectiveness of CrossFake under the cross-lingual setting and it also outperforms several monolingual and cross-lingual fake news detectors. The dataset is available at https://github.com/YingtongDou/CrossFake.

* Accepted by SDM at ICDM, data is available at https://github.com/YingtongDou/CrossFake

Via

Access Paper or Ask Questions

HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization

Oct 12, 2021
Ye Liu, Jian-Guo Zhang, Yao Wan, Congying Xia, Lifang He, Philip S. Yu

Figure 1 for HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization

Figure 2 for HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization

Figure 3 for HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization

Figure 4 for HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization

To capture the semantic graph structure from raw text, most existing summarization approaches are built on GNNs with a pre-trained model. However, these methods suffer from cumbersome procedures and inefficient computations for long-text documents. To mitigate these issues, this paper proposes HETFORMER, a Transformer-based pre-trained model with multi-granularity sparse attentions for long-text extractive summarization. Specifically, we model different types of semantic nodes in raw text as a potential heterogeneous graph and directly learn heterogeneous relationships (edges) among nodes by Transformer. Extensive experiments on both single- and multi-document summarization tasks show that HETFORMER achieves state-of-the-art performance in Rouge F1 while using less memory and fewer parameters.

* EMNLP 2021

Via

Access Paper or Ask Questions

Deep Fraud Detection on Non-attributed Graph

Oct 04, 2021
Chen Wang, Yingtong Dou, Min Chen, Jia Chen, Zhiwei Liu, Philip S. Yu

Figure 1 for Deep Fraud Detection on Non-attributed Graph

Figure 2 for Deep Fraud Detection on Non-attributed Graph

Figure 3 for Deep Fraud Detection on Non-attributed Graph

Figure 4 for Deep Fraud Detection on Non-attributed Graph

Fraud detection problems are usually formulated as a machine learning problem on a graph. Recently, Graph Neural Networks (GNNs) have shown solid performance on fraud detection. The successes of most previous methods heavily rely on rich node features and high-fidelity labels. However, labeled data is scarce in large-scale industrial problems, especially for fraud detection where new patterns emerge from time to time. Meanwhile, node features are also limited due to privacy and other constraints. In this paper, two improvements are proposed: 1) We design a graph transformation method capturing the structural information to facilitate GNNs on non-attributed fraud graphs. 2) We propose a novel graph pre-training strategy to leverage more unlabeled data via contrastive learning. Experiments on a large-scale industrial dataset demonstrate the effectiveness of the proposed framework for fraud detection.

Via

Access Paper or Ask Questions

Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction

Sep 14, 2021
Xuming Hu, Chenwei Zhang, Yawen Yang, Xiaohe Li, Li Lin, Lijie Wen, Philip S. Yu

Figure 1 for Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction

Figure 2 for Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction

Figure 3 for Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction

Figure 4 for Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction

Low-resource Relation Extraction (LRE) aims to extract relation facts from limited labeled corpora when human annotation is scarce. Existing works either utilize self-training scheme to generate pseudo labels that will cause the gradual drift problem, or leverage meta-learning scheme which does not solicit feedback explicitly. To alleviate selection bias due to the lack of feedback loops in existing LRE learning paradigms, we developed a Gradient Imitation Reinforcement Learning method to encourage pseudo label data to imitate the gradient descent direction on labeled data and bootstrap its optimization capability through trial and error. We also propose a framework called GradLRE, which handles two major scenarios in low-resource relation extraction. Besides the scenario where unlabeled data is sufficient, GradLRE handles the situation where no unlabeled data is available, by exploiting a contextualized augmentation method to generate data. Experimental results on two public datasets demonstrate the effectiveness of GradLRE on low resource relation extraction when comparing with baselines.

* In EMNLP 2021 as a long paper. Code and data available at https://github.com/THU-BPM/GradLRE

Via

Access Paper or Ask Questions

Hyper Meta-Path Contrastive Learning for Multi-Behavior Recommendation

Sep 07, 2021
Haoran Yang, Hongxu Chen, Lin Li, Philip S. Yu, Guandong Xu

Figure 1 for Hyper Meta-Path Contrastive Learning for Multi-Behavior Recommendation

Figure 2 for Hyper Meta-Path Contrastive Learning for Multi-Behavior Recommendation

Figure 3 for Hyper Meta-Path Contrastive Learning for Multi-Behavior Recommendation

Figure 4 for Hyper Meta-Path Contrastive Learning for Multi-Behavior Recommendation

User purchasing prediction with multi-behavior information remains a challenging problem for current recommendation systems. Various methods have been proposed to address it via leveraging the advantages of graph neural networks (GNNs) or multi-task learning. However, most existing works do not take the complex dependencies among different behaviors of users into consideration. They utilize simple and fixed schemes, like neighborhood information aggregation or mathematical calculation of vectors, to fuse the embeddings of different user behaviors to obtain a unified embedding to represent a user's behavioral patterns which will be used in downstream recommendation tasks. To tackle the challenge, in this paper, we first propose the concept of hyper meta-path to construct hyper meta-paths or hyper meta-graphs to explicitly illustrate the dependencies among different behaviors of a user. How to obtain a unified embedding for a user from hyper meta-paths and avoid the previously mentioned limitations simultaneously is critical. Thanks to the recent success of graph contrastive learning, we leverage it to learn embeddings of user behavior patterns adaptively instead of assigning a fixed scheme to understand the dependencies among different behaviors. A new graph contrastive learning based framework is proposed by coupling with hyper meta-paths, namely HMG-CR, which consistently and significantly outperforms all baselines in extensive comparison experiments.

* Accepted by ICDM 2021 as a regular paper

Via

Access Paper or Ask Questions

DSKReG: Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN

Aug 26, 2021
Yu Wang, Zhiwei Liu, Ziwei Fan, Lichao Sun, Philip S. Yu

Figure 1 for DSKReG: Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN

Figure 2 for DSKReG: Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN

Figure 3 for DSKReG: Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN

Figure 4 for DSKReG: Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN

In the information explosion era, recommender systems (RSs) are widely studied and applied to discover user-preferred information. A RS performs poorly when suffering from the cold-start issue, which can be alleviated if incorporating Knowledge Graphs (KGs) as side information. However, most existing works neglect the facts that node degrees in KGs are skewed and massive amount of interactions in KGs are recommendation-irrelevant. To address these problems, in this paper, we propose Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN (DSKReG) that learns the relevance distribution of connected items from KGs and samples suitable items for recommendation following this distribution. We devise a differentiable sampling strategy, which enables the selection of relevant items to be jointly optimized with the model training procedure. The experimental results demonstrate that our model outperforms state-of-the-art KG-based recommender systems. The code is available online at https://github.com/YuWang-1024/DSKReG.

* Accepted by CIKM 2021

Via

Access Paper or Ask Questions

Continuous-Time Sequential Recommendation with Temporal Graph Collaborative Transformer

Aug 22, 2021
Ziwei Fan, Zhiwei Liu, Jiawei Zhang, Yun Xiong, Lei Zheng, Philip S. Yu

Figure 1 for Continuous-Time Sequential Recommendation with Temporal Graph Collaborative Transformer

Figure 2 for Continuous-Time Sequential Recommendation with Temporal Graph Collaborative Transformer

Figure 3 for Continuous-Time Sequential Recommendation with Temporal Graph Collaborative Transformer

Figure 4 for Continuous-Time Sequential Recommendation with Temporal Graph Collaborative Transformer

In order to model the evolution of user preference, we should learn user/item embeddings based on time-ordered item purchasing sequences, which is defined as Sequential Recommendation (SR) problem. Existing methods leverage sequential patterns to model item transitions. However, most of them ignore crucial temporal collaborative signals, which are latent in evolving user-item interactions and coexist with sequential patterns. Therefore, we propose to unify sequential patterns and temporal collaborative signals to improve the quality of recommendation, which is rather challenging. Firstly, it is hard to simultaneously encode sequential patterns and collaborative signals. Secondly, it is non-trivial to express the temporal effects of collaborative signals. Hence, we design a new framework Temporal Graph Sequential Recommender (TGSRec) upon our defined continuous-time bi-partite graph. We propose a novel Temporal Collaborative Trans-former (TCT) layer in TGSRec, which advances the self-attention mechanism by adopting a novel collaborative attention. TCT layer can simultaneously capture collaborative signals from both users and items, as well as considering temporal dynamics inside sequential patterns. We propagate the information learned fromTCTlayerover the temporal graph to unify sequential patterns and temporal collaborative signals. Empirical results on five datasets show that TGSRec significantly outperforms other baselines, in average up to 22.5% and 22.1%absolute improvements in Recall@10and MRR, respectively.

* accepted by CIKM2021, with updated code in https://github.com/DyGRec/TGSRec

Via

Access Paper or Ask Questions

Contrastive Self-supervised Sequential Recommendation with Robust Augmentation

Aug 14, 2021
Zhiwei Liu, Yongjun Chen, Jia Li, Philip S. Yu, Julian McAuley, Caiming Xiong

Figure 1 for Contrastive Self-supervised Sequential Recommendation with Robust Augmentation

Figure 2 for Contrastive Self-supervised Sequential Recommendation with Robust Augmentation

Figure 3 for Contrastive Self-supervised Sequential Recommendation with Robust Augmentation

Figure 4 for Contrastive Self-supervised Sequential Recommendation with Robust Augmentation

Sequential Recommendationdescribes a set of techniques to model dynamic user behavior in order to predict future interactions in sequential user data. At their core, such approaches model transition probabilities between items in a sequence, whether through Markov chains, recurrent networks, or more recently, Transformers. However both old and new issues remain, including data-sparsity and noisy data; such issues can impair the performance, especially in complex, parameter-hungry models. In this paper, we investigate the application of contrastive Self-Supervised Learning (SSL) to the sequential recommendation, as a way to alleviate some of these issues. Contrastive SSL constructs augmentations from unlabelled instances, where agreements among positive pairs are maximized. It is challenging to devise a contrastive SSL framework for a sequential recommendation, due to its discrete nature, correlations among items, and skewness of length distributions. To this end, we propose a novel framework, Contrastive Self-supervised Learning for sequential Recommendation (CoSeRec). We introduce two informative augmentation operators leveraging item correlations to create high-quality views for contrastive learning. Experimental results on three real-world datasets demonstrate the effectiveness of the proposed method on improving model performance and the robustness against sparse and noisy data. Our implementation is available online at \url{https://github.com/YChen1993/CoSeRec}

* Under-review. Work done during Zhiwei's intern at Salesforce. Inc

Via

Access Paper or Ask Questions