Picture for Flavian Vasile

Flavian Vasile

From Clicks to Conversions: Recommendation for long-term reward

Add code
Sep 01, 2020
Viaarxiv icon

BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals

Add code
Aug 28, 2020
Figure 1 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Figure 2 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Figure 3 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Figure 4 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Viaarxiv icon

Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks

Add code
Oct 03, 2019
Figure 1 for Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Figure 2 for Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Figure 3 for Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Viaarxiv icon

Learning from Bandit Feedback: An Overview of the State-of-the-art

Add code
Sep 18, 2019
Figure 1 for Learning from Bandit Feedback: An Overview of the State-of-the-art
Figure 2 for Learning from Bandit Feedback: An Overview of the State-of-the-art
Viaarxiv icon

Relaxed Softmax for learning from Positive and Unlabeled data

Add code
Sep 17, 2019
Figure 1 for Relaxed Softmax for learning from Positive and Unlabeled data
Figure 2 for Relaxed Softmax for learning from Positive and Unlabeled data
Figure 3 for Relaxed Softmax for learning from Positive and Unlabeled data
Figure 4 for Relaxed Softmax for learning from Positive and Unlabeled data
Viaarxiv icon

Recommendation System-based Upper Confidence Bound for Online Advertising

Add code
Sep 09, 2019
Figure 1 for Recommendation System-based Upper Confidence Bound for Online Advertising
Figure 2 for Recommendation System-based Upper Confidence Bound for Online Advertising
Figure 3 for Recommendation System-based Upper Confidence Bound for Online Advertising
Figure 4 for Recommendation System-based Upper Confidence Bound for Online Advertising
Viaarxiv icon

On the Value of Bandit Feedback for Offline Recommender System Evaluation

Add code
Jul 26, 2019
Figure 1 for On the Value of Bandit Feedback for Offline Recommender System Evaluation
Figure 2 for On the Value of Bandit Feedback for Offline Recommender System Evaluation
Viaarxiv icon

Distributionally Robust Counterfactual Risk Minimization

Add code
Jun 14, 2019
Figure 1 for Distributionally Robust Counterfactual Risk Minimization
Figure 2 for Distributionally Robust Counterfactual Risk Minimization
Figure 3 for Distributionally Robust Counterfactual Risk Minimization
Viaarxiv icon

Three Methods for Training on Bandit Feedback

Add code
Apr 24, 2019
Figure 1 for Three Methods for Training on Bandit Feedback
Figure 2 for Three Methods for Training on Bandit Feedback
Viaarxiv icon

RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising

Add code
Sep 14, 2018
Figure 1 for RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
Figure 2 for RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
Figure 3 for RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
Figure 4 for RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
Viaarxiv icon