Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Can Xu

LFGCF: Light Folksonomy Graph Collaborative Filtering for Tag-Aware Recommendation

Aug 06, 2022

Yin Zhang, Can Xu, XianJun Wu, Yan Zhang, LiGang Dong, Weigang Wang

Figure 1 for LFGCF: Light Folksonomy Graph Collaborative Filtering for Tag-Aware Recommendation

Figure 2 for LFGCF: Light Folksonomy Graph Collaborative Filtering for Tag-Aware Recommendation

Figure 3 for LFGCF: Light Folksonomy Graph Collaborative Filtering for Tag-Aware Recommendation

Figure 4 for LFGCF: Light Folksonomy Graph Collaborative Filtering for Tag-Aware Recommendation

Abstract:Tag-aware recommendation is a task of predicting a personalized list of items for a user by their tagging behaviors. It is crucial for many applications with tagging capabilities like last.fm or movielens. Recently, many efforts have been devoted to improving Tag-aware recommendation systems (TRS) with Graph Convolutional Networks (GCN), which has become new state-of-the-art for the general recommendation. However, some solutions are directly inherited from GCN without justifications, which is difficult to alleviate the sparsity, ambiguity, and redundancy issues introduced by tags, thus adding to difficulties of training and degrading recommendation performance. In this work, we aim to simplify the design of GCN to make it more concise for TRS. We propose a novel tag-aware recommendation model named Light Folksonomy Graph Collaborative Filtering (LFGCF), which only includes the essential GCN components. Specifically, LFGCF first constructs Folksonomy Graphs from the records of user assigning tags and item getting tagged. Then we leverage the simple design of aggregation to learn the high-order representations on Folksonomy Graphs and use the weighted sum of the embeddings learned at several layers for information updating. We share tags embeddings to bridge the information gap between users and items. Besides, a regularization function named TransRT is proposed to better depict user preferences and item features. Extensive hyperparameters experiments and ablation studies on three real-world datasets show that LFGCF uses fewer parameters and significantly outperforms most baselines for the tag-aware top-N recommendations.

Via

Access Paper or Ask Questions

KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Few-Shot NLP

Jun 21, 2022

Yufei Wang, Jiayi Zheng, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Daxin Jiang

Figure 1 for KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Few-Shot NLP

Figure 2 for KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Few-Shot NLP

Figure 3 for KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Few-Shot NLP

Figure 4 for KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Few-Shot NLP

Abstract:This paper focuses on text data augmentation for few-shot NLP tasks. The existing data augmentation algorithms either leverage task-independent heuristic rules (e.g., Synonym Replacement) or fine-tune general-purpose pre-trained language models (e.g., GPT2) using a small training set to produce new synthetic data. Consequently, these methods have trivial task-specific knowledge and are limited to yielding low-quality synthetic data for weak baselines in simple tasks. To combat this issue, we propose the Knowledge Mixture Data Augmentation Model (KnowDA): an encoder-decoder LM pretrained on a mixture of diverse NLP tasks using Knowledge Mixture Training (KoMT). KoMT is a training procedure that reformulates input examples from various heterogeneous NLP tasks into a unified text-to-text format and employs denoising objectives in different granularity to learn to generate partial or complete samples. With the aid of KoMT, KnowDA could combine required task-specific knowledge implicitly from the learned mixture of tasks and quickly grasp the inherent synthesis law of the target task through a few given instances. To the best of our knowledge, we are the first attempt to scale the number of tasks to 100+ in multi-task co-training for data augmentation. Extensive experiments show that i) KnowDA successfully improves the performance of Albert and Deberta by a large margin on the FewGLUE benchmark, outperforming previous state-of-the-art data augmentation baselines; ii) KnowDA could also improve the model performance on the few-shot NER tasks, a held-out task type not included in KoMT.

Via

Access Paper or Ask Questions

Towards Robust Ranker for Text Retrieval

Jun 16, 2022

Yucheng Zhou, Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Guodong Long, Binxing Jiao, Daxin Jiang

Figure 1 for Towards Robust Ranker for Text Retrieval

Figure 2 for Towards Robust Ranker for Text Retrieval

Figure 3 for Towards Robust Ranker for Text Retrieval

Figure 4 for Towards Robust Ranker for Text Retrieval

Abstract:A ranker plays an indispensable role in the de facto 'retrieval & rerank' pipeline, but its training still lags behind -- learning from moderate negatives or/and serving as an auxiliary module for a retriever. In this work, we first identify two major barriers to a robust ranker, i.e., inherent label noises caused by a well-trained retriever and non-ideal negatives sampled for a high-capable ranker. Thereby, we propose multiple retrievers as negative generators improve the ranker's robustness, where i) involving extensive out-of-distribution label noises renders the ranker against each noise distribution, and ii) diverse hard negatives from a joint distribution are relatively close to the ranker's negative distribution, leading to more challenging thus effective training. To evaluate our robust ranker (dubbed R$^2$anker), we conduct experiments in various settings on the popular passage retrieval benchmark, including BM25-reranking, full-ranking, retriever distillation, etc. The empirical results verify the new state-of-the-art effectiveness of our model.

* 11 pages of main content, 4 tables, 3 figures

Via

Access Paper or Ask Questions

UnifieR: A Unified Retriever for Large-Scale Retrieval

May 23, 2022

Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Kai Zhang, Daxin Jiang

Figure 1 for UnifieR: A Unified Retriever for Large-Scale Retrieval

Figure 2 for UnifieR: A Unified Retriever for Large-Scale Retrieval

Figure 3 for UnifieR: A Unified Retriever for Large-Scale Retrieval

Figure 4 for UnifieR: A Unified Retriever for Large-Scale Retrieval

Abstract:Large-scale retrieval is to recall relevant documents from a huge collection given a query. It relies on representation learning to embed documents and queries into a common semantic encoding space. According to the encoding space, recent retrieval methods based on pre-trained language models (PLM) can be coarsely categorized into either dense-vector or lexicon-based paradigms. These two paradigms unveil the PLMs' representation capability in different granularities, i.e., global sequence-level compression and local word-level contexts, respectively. Inspired by their complementary global-local contextualization and distinct representing views, we propose a new learning framework, UnifieR, which unifies dense-vector and lexicon-based retrieval in one model with a dual-representing capability. Experiments on passage retrieval benchmarks verify its effectiveness in both paradigms. A uni-retrieval scheme is further presented with even better retrieval quality. We lastly evaluate the model on BEIR benchmark to verify its transferability.

* 20 pages, 6 figures, 11 tables

Via

Access Paper or Ask Questions

Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting

Apr 12, 2022

Qingfeng Sun, Can Xu, Huang Hu, Yujing Wang, Jian Miao, Xiubo Geng, Yining Chen, Fei Xu, Daxin Jiang

Figure 1 for Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting

Figure 2 for Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting

Figure 3 for Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting

Figure 4 for Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting

Abstract:Current Knowledge-Grounded Dialogue Generation (KDG) models specialize in producing rational and factual responses. However, to establish long-term relationships with users, the KDG model needs the capability to generate responses in a desired style or attribute. Thus, we study a new problem: Stylized Knowledge-Grounded Dialogue Generation (SKDG). It presents two challenges: (1) How to train a SKDG model where no <context, knowledge, stylized response> triples are available. (2) How to cohere with context and preserve the knowledge when generating a stylized response. In this paper, we propose a novel disentangled template rewriting (DTR) method which generates responses via combing disentangled style templates (from monolingual stylized corpus) and content templates (from KDG corpus). The entire framework is end-to-end differentiable and learned without supervision. Extensive experiments on two benchmarks indicate that DTR achieves a significant improvement on all evaluation metrics compared with previous state-of-the-art stylized dialogue generation methods. Besides, DTR achieves comparable performance with the state-of-the-art KDG methods in standard KDG evaluation setting.

* Accepted to NAACL 2022 Main Conference

Via

Access Paper or Ask Questions

PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

Mar 17, 2022

Yufei Wang, Can Xu, Qingfeng Sun, Huang Hu, Chongyang Tao, Xiubo Geng, Daxin Jiang

Figure 1 for PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

Figure 2 for PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

Figure 3 for PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

Figure 4 for PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

Abstract:This paper focuses on the Data Augmentation for low-resource Natural Language Understanding (NLU) tasks. We propose Prompt-based D}ata Augmentation model (PromDA) which only trains small-scale Soft Prompt (i.e., a set of trainable vectors) in the frozen Pre-trained Language Models (PLMs). This avoids human effort in collecting unlabeled in-domain data and maintains the quality of generated synthetic data. In addition, PromDA generates synthetic data via two different views and filters out the low-quality data using NLU models. Experiments on four benchmarks show that synthetic data produced by PromDA successfully boost up the performance of NLU models which consistently outperform several competitive baseline models, including a state-of-the-art semi-supervised model using unlabeled in-domain data. The synthetic data from PromDA are also complementary with unlabeled in-domain data. The NLU models can be further improved when they are combined for training.

* Accepted to ACL 2022 Main Conference, Camera-Ready Version

Via

Access Paper or Ask Questions

TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

Mar 16, 2022

Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang

Figure 1 for TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

Figure 2 for TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

Figure 3 for TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

Figure 4 for TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

Abstract:Generating natural and informative texts has been a long-standing problem in NLP. Much effort has been dedicated into incorporating pre-trained language models (PLMs) with various open-world knowledge, such as knowledge graphs or wiki pages. However, their ability to access and manipulate the task-specific knowledge is still limited on downstream tasks, as this type of knowledge is usually not well covered in PLMs and is hard to acquire. To address the problem, we propose augmenting TExt Generation via Task-specific and Open-world Knowledge (TegTok) in a unified framework. Our model selects knowledge entries from two types of knowledge sources through dense retrieval and then injects them into the input encoding and output decoding stages respectively on the basis of PLMs. With the help of these two types of knowledge, our model can learn what and how to generate. Experiments on two text generation tasks of dialogue generation and question generation, and on two datasets show that our method achieves better performance than various baseline models.

* Accepted by Findings of ACL 2022

Via

Access Paper or Ask Questions

PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Jan 28, 2022

Qiyu Wu, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Daxin Jiang

Figure 1 for PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Figure 2 for PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Figure 3 for PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Figure 4 for PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

Abstract:Learning sentence embeddings in an unsupervised manner is fundamental in natural language processing. Recent common practice is to couple pre-trained language models with unsupervised contrastive learning, whose success relies on augmenting a sentence with a semantically-close positive instance to construct contrastive pairs. Nonetheless, existing approaches usually depend on a mono-augmenting strategy, which causes learning shortcuts towards the augmenting biases and thus corrupts the quality of sentence embeddings. A straightforward solution is resorting to more diverse positives from a multi-augmenting strategy, while an open question remains about how to unsupervisedly learn from the diverse positives but with uneven augmenting qualities in the text field. As one answer, we propose a novel Peer-Contrastive Learning (PCL) with diverse augmentations. PCL constructs diverse contrastive positives and negatives at the group level for unsupervised sentence embeddings. PCL can perform peer-positive contrast as well as peer-network cooperation, which offers an inherent anti-bias ability and an effective way to learn from diverse augmentations. Experiments on STS benchmarks verify the effectiveness of our PCL against its competitors in unsupervised sentence embeddings.

Via

Access Paper or Ask Questions

Recency Dropout for Recurrent Recommender Systems

Jan 26, 2022

Bo Chang, Can Xu, Matthieu Lê, Jingchen Feng, Ya Le, Sriraj Badam, Ed Chi, Minmin Chen

Figure 1 for Recency Dropout for Recurrent Recommender Systems

Figure 2 for Recency Dropout for Recurrent Recommender Systems

Figure 3 for Recency Dropout for Recurrent Recommender Systems

Figure 4 for Recency Dropout for Recurrent Recommender Systems

Abstract:Recurrent recommender systems have been successful in capturing the temporal dynamics in users' activity trajectories. However, recurrent neural networks (RNNs) are known to have difficulty learning long-term dependencies. As a consequence, RNN-based recommender systems tend to overly focus on short-term user interests. This is referred to as the recency bias, which could negatively affect the long-term user experience as well as the health of the ecosystem. In this paper, we introduce the recency dropout technique, a simple yet effective data augmentation technique to alleviate the recency bias in recurrent recommender systems. We demonstrate the effectiveness of recency dropout in various experimental settings including a simulation study, offline experiments, as well as live experiments on a large-scale industrial recommendation platform.

Via

Access Paper or Ask Questions

Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning

Nov 25, 2021

Yuntao Li, Can Xu, Huang Hu, Lei Sha, Yan Zhang, Daxin Jiang

Figure 1 for Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning

Figure 2 for Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning

Figure 3 for Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning

Figure 4 for Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning

Abstract:Retrieve-based dialogue response selection aims to find a proper response from a candidate set given a multi-turn context. Pre-trained language models (PLMs) based methods have yielded significant improvements on this task. The sequence representation plays a key role in the learning of matching degree between the dialogue context and the response. However, we observe that different context-response pairs sharing the same context always have a greater similarity in the sequence representations calculated by PLMs, which makes it hard to distinguish positive responses from negative ones. Motivated by this, we propose a novel \textbf{F}ine-\textbf{G}rained \textbf{C}ontrastive (FGC) learning method for the response selection task based on PLMs. This FGC learning strategy helps PLMs to generate more distinguishable matching representations of each dialogue at fine grains, and further make better predictions on choosing positive responses. Empirical studies on two benchmark datasets demonstrate that the proposed FGC learning method can generally and significantly improve the model performance of existing PLM-based matching models.

Via

Access Paper or Ask Questions