Picture for Zhenghai Xue

Zhenghai Xue

Group-in-Group Policy Optimization for LLM Agent Training

Add code
May 16, 2025
Viaarxiv icon

Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

Modeling User Retention through Generative Flow Networks

Add code
Jun 10, 2024
Figure 1 for Modeling User Retention through Generative Flow Networks
Figure 2 for Modeling User Retention through Generative Flow Networks
Figure 3 for Modeling User Retention through Generative Flow Networks
Figure 4 for Modeling User Retention through Generative Flow Networks
Viaarxiv icon

S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic

Add code
May 02, 2024
Viaarxiv icon

AgentStudio: A Toolkit for Building General Virtual Agents

Add code
Mar 26, 2024
Viaarxiv icon

AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement

Add code
Oct 06, 2023
Viaarxiv icon

A Large Language Model Enhanced Conversational Recommender System

Add code
Aug 11, 2023
Viaarxiv icon

State Regularized Policy Optimization on Data with Dynamics Shift

Add code
Jun 06, 2023
Figure 1 for State Regularized Policy Optimization on Data with Dynamics Shift
Figure 2 for State Regularized Policy Optimization on Data with Dynamics Shift
Figure 3 for State Regularized Policy Optimization on Data with Dynamics Shift
Figure 4 for State Regularized Policy Optimization on Data with Dynamics Shift
Viaarxiv icon

Guarded Policy Optimization with Imperfect Online Demonstrations

Add code
Mar 03, 2023
Viaarxiv icon

Two-Stage Constrained Actor-Critic for Short Video Recommendation

Add code
Feb 06, 2023
Figure 1 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 2 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 3 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 4 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Viaarxiv icon