Alert button
Picture for Craig Boutilier

Craig Boutilier

Alert button

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities

May 06, 2021
Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen

Figure 1 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 2 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 3 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 4 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Viaarxiv icon

RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems

Mar 14, 2021
Martin Mladenov, Chih-Wei Hsu, Vihan Jain, Eugene Ie, Christopher Colby, Nicolas Mayoraz, Hubert Pham, Dustin Tran, Ivan Vendrov, Craig Boutilier

Figure 1 for RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Figure 2 for RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Figure 3 for RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Viaarxiv icon

Meta-Thompson Sampling

Feb 11, 2021
Branislav Kveton, Mikhail Konobeev, Manzil Zaheer, Chih-wei Hsu, Martin Mladenov, Craig Boutilier, Csaba Szepesvari

Figure 1 for Meta-Thompson Sampling
Figure 2 for Meta-Thompson Sampling
Figure 3 for Meta-Thompson Sampling
Viaarxiv icon

Non-Stationary Latent Bandits

Dec 01, 2020
Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Non-Stationary Latent Bandits
Figure 2 for Non-Stationary Latent Bandits
Figure 3 for Non-Stationary Latent Bandits
Viaarxiv icon

Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach

Aug 18, 2020
Martin Mladenov, Elliot Creager, Omer Ben-Porat, Kevin Swersky, Richard Zemel, Craig Boutilier

Figure 1 for Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
Figure 2 for Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
Figure 3 for Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
Figure 4 for Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
Viaarxiv icon

Latent Bandits Revisited

Jun 15, 2020
Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Craig Boutilier

Figure 1 for Latent Bandits Revisited
Figure 2 for Latent Bandits Revisited
Viaarxiv icon

Differentiable Meta-Learning in Contextual Bandits

Jun 09, 2020
Branislav Kveton, Martin Mladenov, Chih-Wei Hsu, Manzil Zaheer, Csaba Szepesvari, Craig Boutilier

Figure 1 for Differentiable Meta-Learning in Contextual Bandits
Figure 2 for Differentiable Meta-Learning in Contextual Bandits
Figure 3 for Differentiable Meta-Learning in Contextual Bandits
Figure 4 for Differentiable Meta-Learning in Contextual Bandits
Viaarxiv icon

ConQUR: Mitigating Delusional Bias in Deep Q-learning

Feb 27, 2020
Andy Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier

Figure 1 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 2 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 3 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 4 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Viaarxiv icon

Differentiable Bandit Exploration

Feb 17, 2020
Craig Boutilier, Chih-Wei Hsu, Branislav Kveton, Martin Mladenov, Csaba Szepesvari, Manzil Zaheer

Figure 1 for Differentiable Bandit Exploration
Figure 2 for Differentiable Bandit Exploration
Figure 3 for Differentiable Bandit Exploration
Viaarxiv icon