Alert button
Picture for Zhenghai Xue

Zhenghai Xue

Alert button

AgentStudio: A Toolkit for Building General Virtual Agents

Add code
Bookmark button
Alert button
Mar 26, 2024
Longtao Zheng, Zhiyuan Huang, Zhenghai Xue, Xinrun Wang, Bo An, Shuicheng Yan

Viaarxiv icon

AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement

Add code
Bookmark button
Alert button
Oct 06, 2023
Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An

Figure 1 for AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement
Figure 2 for AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement
Figure 3 for AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement
Figure 4 for AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement
Viaarxiv icon

A Large Language Model Enhanced Conversational Recommender System

Add code
Bookmark button
Alert button
Aug 11, 2023
Yue Feng, Shuchang Liu, Zhenghai Xue, Qingpeng Cai, Lantao Hu, Peng Jiang, Kun Gai, Fei Sun

Figure 1 for A Large Language Model Enhanced Conversational Recommender System
Figure 2 for A Large Language Model Enhanced Conversational Recommender System
Figure 3 for A Large Language Model Enhanced Conversational Recommender System
Figure 4 for A Large Language Model Enhanced Conversational Recommender System
Viaarxiv icon

State Regularized Policy Optimization on Data with Dynamics Shift

Add code
Bookmark button
Alert button
Jun 06, 2023
Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An

Figure 1 for State Regularized Policy Optimization on Data with Dynamics Shift
Figure 2 for State Regularized Policy Optimization on Data with Dynamics Shift
Figure 3 for State Regularized Policy Optimization on Data with Dynamics Shift
Figure 4 for State Regularized Policy Optimization on Data with Dynamics Shift
Viaarxiv icon

Guarded Policy Optimization with Imperfect Online Demonstrations

Add code
Bookmark button
Alert button
Mar 03, 2023
Zhenghai Xue, Zhenghao Peng, Quanyi Li, Zhihan Liu, Bolei Zhou

Figure 1 for Guarded Policy Optimization with Imperfect Online Demonstrations
Figure 2 for Guarded Policy Optimization with Imperfect Online Demonstrations
Figure 3 for Guarded Policy Optimization with Imperfect Online Demonstrations
Figure 4 for Guarded Policy Optimization with Imperfect Online Demonstrations
Viaarxiv icon

Two-Stage Constrained Actor-Critic for Short Video Recommendation

Add code
Bookmark button
Alert button
Feb 06, 2023
Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai

Figure 1 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 2 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 3 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 4 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Viaarxiv icon

PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement

Add code
Bookmark button
Alert button
Dec 06, 2022
Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Bo An

Figure 1 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Figure 2 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Figure 3 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Figure 4 for PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
Viaarxiv icon

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 26, 2021
Quanyi Li, Zhenghao Peng, Zhenghai Xue, Qihang Zhang, Bolei Zhou

Figure 1 for MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Figure 2 for MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Figure 3 for MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Figure 4 for MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Viaarxiv icon

Regret Minimization Experience Replay

Add code
Bookmark button
Alert button
Jun 06, 2021
Zhenghai Xue, Xu-Hui Liu, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu

Figure 1 for Regret Minimization Experience Replay
Figure 2 for Regret Minimization Experience Replay
Figure 3 for Regret Minimization Experience Replay
Figure 4 for Regret Minimization Experience Replay
Viaarxiv icon