Picture for Alex Nikulkov

Alex Nikulkov

Pearl: A Production-ready Reinforcement Learning Agent

Add code
Dec 06, 2023
Viaarxiv icon

Offline Reinforcement Learning for Optimizing Production Bidding Policies

Add code
Oct 13, 2023
Figure 1 for Offline Reinforcement Learning for Optimizing Production Bidding Policies
Figure 2 for Offline Reinforcement Learning for Optimizing Production Bidding Policies
Figure 3 for Offline Reinforcement Learning for Optimizing Production Bidding Policies
Figure 4 for Offline Reinforcement Learning for Optimizing Production Bidding Policies
Viaarxiv icon

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

Add code
May 24, 2023
Figure 1 for Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning
Figure 2 for Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning
Figure 3 for Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning
Figure 4 for Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning
Viaarxiv icon

Optimism Based Exploration in Large-Scale Recommender Systems

Add code
Apr 05, 2023
Figure 1 for Optimism Based Exploration in Large-Scale Recommender Systems
Figure 2 for Optimism Based Exploration in Large-Scale Recommender Systems
Figure 3 for Optimism Based Exploration in Large-Scale Recommender Systems
Figure 4 for Optimism Based Exploration in Large-Scale Recommender Systems
Viaarxiv icon