Picture for David Rohde

David Rohde

Unified PAC-Bayesian Study of Pessimism for Offline Policy Learning with Regularized Importance Sampling

Add code
Jun 05, 2024
Viaarxiv icon

Bayesian Off-Policy Evaluation and Learning for Large Action Spaces

Add code
Feb 22, 2024
Figure 1 for Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Figure 2 for Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Figure 3 for Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Figure 4 for Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Viaarxiv icon

Position Paper: Why the Shooting in the Dark Method Dominates Recommender Systems Practice; A Call to Abandon Anti-Utopian Thinking

Add code
Feb 08, 2024
Viaarxiv icon

Fast Slate Policy Optimization: Going Beyond Plackett-Luce

Add code
Aug 03, 2023
Figure 1 for Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Figure 2 for Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Figure 3 for Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Figure 4 for Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Viaarxiv icon

Exponential Smoothing for Off-Policy Learning

Add code
May 25, 2023
Figure 1 for Exponential Smoothing for Off-Policy Learning
Figure 2 for Exponential Smoothing for Off-Policy Learning
Figure 3 for Exponential Smoothing for Off-Policy Learning
Figure 4 for Exponential Smoothing for Off-Policy Learning
Viaarxiv icon

Learning from aggregated data with a maximum entropy model

Add code
Oct 05, 2022
Figure 1 for Learning from aggregated data with a maximum entropy model
Figure 2 for Learning from aggregated data with a maximum entropy model
Figure 3 for Learning from aggregated data with a maximum entropy model
Figure 4 for Learning from aggregated data with a maximum entropy model
Viaarxiv icon

Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation

Add code
Sep 18, 2022
Viaarxiv icon

Fast Offline Policy Optimization for Large Scale Recommendation

Add code
Aug 11, 2022
Figure 1 for Fast Offline Policy Optimization for Large Scale Recommendation
Figure 2 for Fast Offline Policy Optimization for Large Scale Recommendation
Figure 3 for Fast Offline Policy Optimization for Large Scale Recommendation
Figure 4 for Fast Offline Policy Optimization for Large Scale Recommendation
Viaarxiv icon

A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation

Add code
Aug 10, 2022
Figure 1 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Figure 2 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Figure 3 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Figure 4 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Viaarxiv icon

Combining Reward and Rank Signals for Slate Recommendation

Add code
Jul 29, 2021
Figure 1 for Combining Reward and Rank Signals for Slate Recommendation
Figure 2 for Combining Reward and Rank Signals for Slate Recommendation
Figure 3 for Combining Reward and Rank Signals for Slate Recommendation
Figure 4 for Combining Reward and Rank Signals for Slate Recommendation
Viaarxiv icon