Picture for Xian Guo

Xian Guo

SMILE: SeMantic Ids Enhanced CoLd Item Representation for Click-through Rate Prediction in E-commerce SEarch

Add code
Oct 14, 2025
Viaarxiv icon

OneSug: The Unified End-to-End Generative Framework for E-commerce Query Suggestion

Add code
Jun 07, 2025
Viaarxiv icon

Retrieval Augmented Learning: A Retrial-based Large Language Model Self-Supervised Learning and Autonomous Knowledge Generation

Add code
May 02, 2025
Viaarxiv icon

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

Add code
Feb 19, 2025
Figure 1 for Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Figure 2 for Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Figure 3 for Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Figure 4 for Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Viaarxiv icon

LLM-PySC2: Starcraft II learning environment for Large Language Models

Add code
Nov 08, 2024
Figure 1 for LLM-PySC2: Starcraft II learning environment for Large Language Models
Figure 2 for LLM-PySC2: Starcraft II learning environment for Large Language Models
Figure 3 for LLM-PySC2: Starcraft II learning environment for Large Language Models
Figure 4 for LLM-PySC2: Starcraft II learning environment for Large Language Models
Viaarxiv icon

Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning

Add code
Nov 13, 2020
Figure 1 for Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Figure 2 for Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Figure 3 for Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Figure 4 for Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Viaarxiv icon