Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaocong Chen

Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Aug 10, 2022

Siyu Wang, Xiaocong Chen, Lina Yao, Julian McAuley

Figure 1 for Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Figure 2 for Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Figure 3 for Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Figure 4 for Deep Reinforcement Learning for Dynamic Recommendation with Model-agnostic Counterfactual Policy Synthesis

Abstract:Recent advances in recommender systems have proved the potential of Reinforcement Learning (RL) to handle the dynamic evolution processes between users and recommender systems. However, learning to train an optimal RL agent is generally impractical with commonly sparse user feedback data in the context of recommender systems. To circumvent the lack of interaction of current RL-based recommender systems, we propose to learn a general Model-agnostic Counterfactual Synthesis Policy for counterfactual user interaction data augmentation. The counterfactual synthesis policy aims to synthesise counterfactual states while preserving significant information in the original state relevant to the user's interests, building upon two different training approaches we designed: learning with expert demonstrations and joint training. As a result, the synthesis of each counterfactual data is based on the current recommendation agent interaction with the environment to adapt to users' dynamic interests. We integrate the proposed policy Deep Deterministic Policy Gradient (DDPG), Soft Actor Critic (SAC) and Twin Delayed DDPG in an adaptive pipeline with a recommendation agent that can generate counterfactual data to improve the performance of recommendation. The empirical results on both online simulation and offline datasets demonstrate the effectiveness and generalisation of our counterfactual synthesis policy and verify that it improves the performance of RL recommendation agents.

Via

Access Paper or Ask Questions

Model-agnostic Counterfactual Synthesis Policy for Interactive Recommendation

Apr 01, 2022

Siyu Wang, Xiaocong Chen, Lina Yao

Figure 1 for Model-agnostic Counterfactual Synthesis Policy for Interactive Recommendation

Figure 2 for Model-agnostic Counterfactual Synthesis Policy for Interactive Recommendation

Figure 3 for Model-agnostic Counterfactual Synthesis Policy for Interactive Recommendation

Abstract:Interactive recommendation is able to learn from the interactive processes between users and systems to confront the dynamic interests of users. Recent advances have convinced that the ability of reinforcement learning to handle the dynamic process can be effectively applied in the interactive recommendation. However, the sparsity of interactive data may hamper the performance of the system. We propose to train a Model-agnostic Counterfactual Synthesis Policy to generate counterfactual data and address the data sparsity problem by modelling from observation and counterfactual distribution. The proposed policy can identify and replace the trivial components for any state in the training process with other agents, which can be deployed in any RL-based algorithm. The experimental results demonstrate the effectiveness and generality of our proposed policy.

Via

Access Paper or Ask Questions

Adversarial Robustness of Deep Reinforcement Learning based Dynamic Recommender Systems

Dec 02, 2021

Siyu Wang, Yuanjiang Cao, Xiaocong Chen, Lina Yao, Xianzhi Wang, Quan Z. Sheng

Figure 1 for Adversarial Robustness of Deep Reinforcement Learning based Dynamic Recommender Systems

Figure 2 for Adversarial Robustness of Deep Reinforcement Learning based Dynamic Recommender Systems

Figure 3 for Adversarial Robustness of Deep Reinforcement Learning based Dynamic Recommender Systems

Figure 4 for Adversarial Robustness of Deep Reinforcement Learning based Dynamic Recommender Systems

Abstract:Adversarial attacks, e.g., adversarial perturbations of the input and adversarial samples, pose significant challenges to machine learning and deep learning techniques, including interactive recommendation systems. The latent embedding space of those techniques makes adversarial attacks difficult to detect at an early stage. Recent advance in causality shows that counterfactual can also be considered one of ways to generate the adversarial samples drawn from different distribution as the training samples. We propose to explore adversarial examples and attack agnostic detection on reinforcement learning-based interactive recommendation systems. We first craft different types of adversarial examples by adding perturbations to the input and intervening on the casual factors. Then, we augment recommendation systems by detecting potential attacks with a deep learning-based classifier based on the crafted data. Finally, we study the attack strength and frequency of adversarial examples and evaluate our model on standard datasets with multiple crafting methods. Our extensive experiments show that most adversarial attacks are effective, and both attack strength and attack frequency impact the attack performance. The strategically-timed attack achieves comparative attack performance with only 1/3 to 1/2 attack frequency. Besides, our black-box detector trained with one crafting method has the generalization ability over several other crafting methods.

* arXiv admin note: text overlap with arXiv:2006.07934

Via

Access Paper or Ask Questions

Locality-Sensitive Experience Replay for Online Recommendation

Oct 21, 2021

Xiaocong Chen, Lina Yao, Xianzhi Wang, Julian McAuley

Figure 1 for Locality-Sensitive Experience Replay for Online Recommendation

Figure 2 for Locality-Sensitive Experience Replay for Online Recommendation

Figure 3 for Locality-Sensitive Experience Replay for Online Recommendation

Figure 4 for Locality-Sensitive Experience Replay for Online Recommendation

Abstract:Online recommendation requires handling rapidly changing user preferences. Deep reinforcement learning (DRL) is gaining interest as an effective means of capturing users' dynamic interest during interactions with recommender systems. However, it is challenging to train a DRL agent, due to large state space (e.g., user-item rating matrix and user profiles), action space (e.g., candidate items), and sparse rewards. Existing studies encourage the agent to learn from past experience via experience replay (ER). They adapt poorly to the complex environment of online recommender systems and are inefficient in determining an optimal strategy from past experience. To address these issues, we design a novel state-aware experience replay model, which uses locality-sensitive hashing to map high dimensional data into low-dimensional representations and a prioritized reward-driven strategy to replay more valuable experience at a higher chance. Our model can selectively pick the most relevant and salient experiences and recommend the agent with the optimal policy. Experiments on three online simulation platforms demonstrate our model' feasibility and superiority toseveral existing experience replay methods.

Via

Access Paper or Ask Questions

A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Sep 09, 2021

Xiaocong Chen, Lina Yao, Julian McAuley, Guanglin Zhou, Xianzhi Wang

Figure 1 for A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Figure 2 for A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Figure 3 for A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Figure 4 for A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Abstract:In light of the emergence of deep reinforcement learning (DRL) in recommender systems research and several fruitful results in recent years, this survey aims to provide a timely and comprehensive overview of the recent trends of deep reinforcement learning in recommender systems. We start with the motivation of applying DRL in recommender systems. Then, we provide a taxonomy of current DRL-based recommender systems and a summary of existing methods. We discuss emerging topics and open issues, and provide our perspective on advancing the domain. This survey serves as introductory material for readers from academia and industry into the topic and identifies notable opportunities for further research.

Via

Access Paper or Ask Questions

Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference

May 05, 2021

Xiaocong Chen, Lina Yao, Xianzhi Wang, Aixin Sun, Wenjie Zhang, Quan Z. Sheng

Figure 1 for Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference

Figure 2 for Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference

Figure 3 for Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference

Figure 4 for Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference

Abstract:Recent advances in reinforcement learning have inspired increasing interest in learning user modeling adaptively through dynamic interactions, e.g., in reinforcement learning based recommender systems. Reward function is crucial for most of reinforcement learning applications as it can provide the guideline about the optimization. However, current reinforcement-learning-based methods rely on manually-defined reward functions, which cannot adapt to dynamic and noisy environments. Besides, they generally use task-specific reward functions that sacrifice generalization ability. We propose a generative inverse reinforcement learning for user behavioral preference modelling, to address the above issues. Instead of using predefined reward functions, our model can automatically learn the rewards from user's actions based on discriminative actor-critic network and Wasserstein GAN. Our model provides a general way of characterizing and explaining underlying behavioral tendencies, and our experiments show our method outperforms state-of-the-art methods in a variety of scenarios, namely traffic signal control, online recommender systems, and scanpath prediction.

Via

Access Paper or Ask Questions

Generative Adversarial U-Net for Domain-free Medical Image Augmentation

Jan 12, 2021

Xiaocong Chen, Yun Li, Lina Yao, Ehsan Adeli, Yu Zhang

Figure 1 for Generative Adversarial U-Net for Domain-free Medical Image Augmentation

Figure 2 for Generative Adversarial U-Net for Domain-free Medical Image Augmentation

Figure 3 for Generative Adversarial U-Net for Domain-free Medical Image Augmentation

Figure 4 for Generative Adversarial U-Net for Domain-free Medical Image Augmentation

Abstract:The shortage of annotated medical images is one of the biggest challenges in the field of medical image computing. Without a sufficient number of training samples, deep learning based models are very likely to suffer from over-fitting problem. The common solution is image manipulation such as image rotation, cropping, or resizing. Those methods can help relieve the over-fitting problem as more training samples are introduced. However, they do not really introduce new images with additional information and may lead to data leakage as the test set may contain similar samples which appear in the training set. To address this challenge, we propose to generate diverse images with generative adversarial network. In this paper, we develop a novel generative method named generative adversarial U-Net , which utilizes both generative adversarial network and U-Net. Different from existing approaches, our newly designed model is domain-free and generalizable to various medical images. Extensive experiments are conducted over eight diverse datasets including computed tomography (CT) scan, pathology, X-ray, etc. The visualization and quantitative results demonstrate the efficacy and good generalization of the proposed method on generating a wide array of high-quality medical images.

Via

Access Paper or Ask Questions

Generative Inverse Deep Reinforcement Learning for Online Recommendation

Nov 04, 2020

Xiaocong Chen, Lina Yao, Aixin Sun, Xianzhi Wang, Xiwei Xu, Liming Zhu

Figure 1 for Generative Inverse Deep Reinforcement Learning for Online Recommendation

Figure 2 for Generative Inverse Deep Reinforcement Learning for Online Recommendation

Figure 3 for Generative Inverse Deep Reinforcement Learning for Online Recommendation

Figure 4 for Generative Inverse Deep Reinforcement Learning for Online Recommendation

Abstract:Deep reinforcement learning enables an agent to capture user's interest through interactions with the environment dynamically. It has attracted great interest in the recommendation research. Deep reinforcement learning uses a reward function to learn user's interest and to control the learning process. However, most reward functions are manually designed; they are either unrealistic or imprecise to reflect the high variety, dimensionality, and non-linearity properties of the recommendation problem. That makes it difficult for the agent to learn an optimal policy to generate the most satisfactory recommendations. To address the above issue, we propose a novel generative inverse reinforcement learning approach, namely InvRec, which extracts the reward function from user's behaviors automatically, for online recommendation. We conduct experiments on an online platform, VirtualTB, and compare with several state-of-the-art methods to demonstrate the feasibility and effectiveness of our proposed approach.

Via

Access Paper or Ask Questions

Momentum Contrastive Learning for Few-Shot COVID-19 Diagnosis from Chest CT Images

Jun 16, 2020

Xiaocong Chen, Lina Yao, Tao Zhou, Jinming Dong, Yu Zhang

Figure 1 for Momentum Contrastive Learning for Few-Shot COVID-19 Diagnosis from Chest CT Images

Figure 2 for Momentum Contrastive Learning for Few-Shot COVID-19 Diagnosis from Chest CT Images

Figure 3 for Momentum Contrastive Learning for Few-Shot COVID-19 Diagnosis from Chest CT Images

Figure 4 for Momentum Contrastive Learning for Few-Shot COVID-19 Diagnosis from Chest CT Images

Abstract:The current pandemic, caused by the outbreak of a novel coronavirus (COVID-19) in December 2019, has led to a global emergency that has significantly impacted economies, healthcare systems and personal wellbeing all around the world. Controlling the rapidly evolving disease requires highly sensitive and specific diagnostics. While real-time RT-PCR is the most commonly used, these can take up to 8 hours, and require significant effort from healthcare professionals. As such, there is a critical need for a quick and automatic diagnostic system. Diagnosis from chest CT images is a promising direction. However, current studies are limited by the lack of sufficient training samples, as acquiring annotated CT images is time-consuming. To this end, we propose a new deep learning algorithm for the automated diagnosis of COVID-19, which only requires a few samples for training. Specifically, we use contrastive learning to train an encoder which can capture expressive feature representations on large and publicly available lung datasets and adopt the prototypical network for classification. We validate the efficacy of the proposed model in comparison with other competing methods on two publicly available and annotated COVID-19 CT datasets. Our results demonstrate the superior performance of our model for the accurate diagnosis of COVID-19 based on chest CT images.

Via

Access Paper or Ask Questions

Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems

Jun 14, 2020

Yuanjiang Cao, Xiaocong Chen, Lina Yao, Xianzhi Wang, Wei Emma Zhang

Figure 1 for Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems

Figure 2 for Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems

Figure 3 for Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems

Figure 4 for Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems

Abstract:Adversarial attacks pose significant challenges for detecting adversarial attacks at an early stage. We propose attack-agnostic detection on reinforcement learning-based interactive recommendation systems. We first craft adversarial examples to show their diverse distributions and then augment recommendation systems by detecting potential attacks with a deep learning-based classifier based on the crafted data. Finally, we study the attack strength and frequency of adversarial examples and evaluate our model on standard datasets with multiple crafting methods. Our extensive experiments show that most adversarial attacks are effective, and both attack strength and attack frequency impact the attack performance. The strategically-timed attack achieves comparative attack performance with only 1/3 to 1/2 attack frequency. Besides, our black-box detector trained with one crafting method has the generalization ability over several crafting methods.

Via

Access Paper or Ask Questions