Alert button
Picture for Wanqi Xue

Wanqi Xue

Alert button

Two-Stage Constrained Actor-Critic for Short Video Recommendation

Add code
Bookmark button
Alert button
Feb 06, 2023
Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai

Figure 1 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 2 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 3 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 4 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Viaarxiv icon

Reinforcement Learning from Diverse Human Preferences

Add code
Bookmark button
Alert button
Jan 30, 2023
Wanqi Xue, Bo An, Shuicheng Yan, Zhongwen Xu

Figure 1 for Reinforcement Learning from Diverse Human Preferences
Figure 2 for Reinforcement Learning from Diverse Human Preferences
Figure 3 for Reinforcement Learning from Diverse Human Preferences
Figure 4 for Reinforcement Learning from Diverse Human Preferences
Viaarxiv icon

PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement

Add code
Bookmark button
Alert button
Dec 06, 2022
Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Bo An

Figure 1 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Figure 2 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Figure 3 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Figure 4 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Viaarxiv icon

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

Add code
Bookmark button
Alert button
Jun 01, 2022
Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Bo An

Figure 1 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Figure 2 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Figure 3 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Figure 4 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Viaarxiv icon

NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search

Add code
Bookmark button
Alert button
Jan 17, 2022
Wanqi Xue, Bo An, Chai Kiat Yeo

Figure 1 for NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search
Figure 2 for NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search
Figure 3 for NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search
Figure 4 for NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search
Viaarxiv icon

Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 09, 2021
Wanqi Xue, Wei Qiu, Bo An, Zinovi Rabinovich, Svetlana Obraztsova, Chai Kiat Yeo

Figure 1 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Figure 2 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Figure 3 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Figure 4 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Viaarxiv icon

Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play

Add code
Bookmark button
Alert button
Jun 02, 2021
Wanqi Xue, Youzhi Zhang, Shuxin Li, Xinrun Wang, Bo An, Chai Kiat Yeo

Figure 1 for Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play
Figure 2 for Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play
Figure 3 for Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play
Figure 4 for Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play
Viaarxiv icon

CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space

Add code
Bookmark button
Alert button
May 18, 2021
Shuxin Li, Youzhi Zhang, Xinrun Wang, Wanqi Xue, Bo An

Figure 1 for CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space
Figure 2 for CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space
Figure 3 for CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space
Figure 4 for CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space
Viaarxiv icon

One-Shot Image Classification by Learning to Restore Prototypes

Add code
Bookmark button
Alert button
May 04, 2020
Wanqi Xue, Wei Wang

Figure 1 for One-Shot Image Classification by Learning to Restore Prototypes
Figure 2 for One-Shot Image Classification by Learning to Restore Prototypes
Figure 3 for One-Shot Image Classification by Learning to Restore Prototypes
Figure 4 for One-Shot Image Classification by Learning to Restore Prototypes
Viaarxiv icon