Alert button

Exploration and Regularization of the Latent Action Space in Recommendation

Feb 08, 2023
Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Kun Gai, Peng Jiang, Xiangyu Zhao, Yongfeng Zhang

Figure 1 for Exploration and Regularization of the Latent Action Space in Recommendation
Figure 2 for Exploration and Regularization of the Latent Action Space in Recommendation
Figure 3 for Exploration and Regularization of the Latent Action Space in Recommendation
Figure 4 for Exploration and Regularization of the Latent Action Space in Recommendation

Share this with someone who'll enjoy it:

In recommender systems, reinforcement learning solutions have effectively boosted recommendation performance because of their ability to capture long-term user-system interaction. However, the action space of the recommendation policy is a list of items, which could be extremely large with a dynamic candidate item pool. To overcome this challenge, we propose a hyper-actor and critic learning framework where the policy decomposes the item list generation process into a hyper-action inference step and an effect-action selection step. The first step maps the given state space into a vectorized hyper-action space, and the second step selects the item list based on the hyper-action. In order to regulate the discrepancy between the two action spaces, we design an alignment module along with a kernel mapping function for items to ensure inference accuracy and include a supervision module to stabilize the learning process. We build simulated environments on public datasets and empirically show that our framework is superior in recommendation compared to standard RL baselines.

* Proceedings of the ACM Web Conference 2023 (WWW '23), May 1--5, 2023, Austin, TX, USA  
View paper onarxiv icon

Share this with someone who'll enjoy it: