Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning

Sep 10, 2018

Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Yixi Wang, Han Li, Jian Xu(+1 more)

Figure 1 for Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning

Figure 2 for Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning

Figure 3 for Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning

Figure 4 for Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:For online advertising in e-commerce, the traditional problem is to assign the right ad to the right user on fixed ad slots. In this paper, we investigate the problem of advertising with adaptive exposure, in which the number of ad slots and their locations can dynamically change over time based on their relative scores with recommendation products. In order to maintain user retention and long-term revenue, there are two types of constraints that need to be met in exposure: query-level and day-level constraints. We model this problem as constrained markov decision process with per-state constraint (psCMDP) and propose a constrained two-level reinforcement learning to decouple the original advertising exposure optimization problem into two relatively independent sub-optimization problems. We also propose a constrained hindsight experience replay mechanism to accelerate the policy training process. Experimental results show that our method can improve the advertising revenue while satisfying different levels of constraints under the real-world datasets. Besides, the proposal of constrained hindsight experience replay mechanism can significantly improve the training speed and the stability of policy performance.

* 10 pages, 8 figures

View paper on

Share this with someone who'll enjoy it:

Title:Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning

Paper and Code