Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Craig Boutilier

Thompson Sampling with a Mixture Prior


Jun 10, 2021
Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier

* 22 pages, 3 figures 

  Access Paper or Ask Questions

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities


May 06, 2021
Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen


  Access Paper or Ask Questions

RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems


Mar 14, 2021
Martin Mladenov, Chih-Wei Hsu, Vihan Jain, Eugene Ie, Christopher Colby, Nicolas Mayoraz, Hubert Pham, Dustin Tran, Ivan Vendrov, Craig Boutilier


  Access Paper or Ask Questions

Meta-Thompson Sampling


Feb 11, 2021
Branislav Kveton, Mikhail Konobeev, Manzil Zaheer, Chih-wei Hsu, Martin Mladenov, Craig Boutilier, Csaba Szepesvari


  Access Paper or Ask Questions

Non-Stationary Latent Bandits


Dec 01, 2020
Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Mohammad Ghavamzadeh, Craig Boutilier

* 15 pages, 4 figures 

  Access Paper or Ask Questions

Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach


Aug 18, 2020
Martin Mladenov, Elliot Creager, Omer Ben-Porat, Kevin Swersky, Richard Zemel, Craig Boutilier


  Access Paper or Ask Questions

Latent Bandits Revisited


Jun 15, 2020
Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Craig Boutilier

* 16 pages, 2 figures 

  Access Paper or Ask Questions

Differentiable Meta-Learning in Contextual Bandits


Jun 09, 2020
Branislav Kveton, Martin Mladenov, Chih-Wei Hsu, Manzil Zaheer, Csaba Szepesvari, Craig Boutilier


  Access Paper or Ask Questions

ConQUR: Mitigating Delusional Bias in Deep Q-learning


Feb 27, 2020
Andy Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier


  Access Paper or Ask Questions

Differentiable Bandit Exploration


Feb 17, 2020
Craig Boutilier, Chih-Wei Hsu, Branislav Kveton, Martin Mladenov, Csaba Szepesvari, Manzil Zaheer


  Access Paper or Ask Questions

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing


Feb 12, 2020
Ge Liu, Rui Wu, Heng-Tze Cheng, Jing Wang, Jayden Ooi, Lihong Li, Ang Li, Wai Lok Sibon Li, Craig Boutilier, Ed Chi

* on Deep Reinforcement Learning workshop at NeurIPS 2019 

  Access Paper or Ask Questions

BRPO: Batch Residual Policy Optimization


Feb 08, 2020
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier


  Access Paper or Ask Questions

Gradient-based Optimization for Bayesian Preference Elicitation


Nov 20, 2019
Ivan Vendrov, Tyler Lu, Qingqing Huang, Craig Boutilier

* To appear in the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) 

  Access Paper or Ask Questions

CAQL: Continuous Action Q-Learning


Oct 09, 2019
Moonkyung Ryu, Yinlam Chow, Ross Anderson, Christian Tjandraatmadja, Craig Boutilier


  Access Paper or Ask Questions

RecSim: A Configurable Simulation Platform for Recommender Systems


Sep 26, 2019
Eugene Ie, Chih-wei Hsu, Martin Mladenov, Vihan Jain, Sanmit Narvekar, Jing Wang, Rui Wu, Craig Boutilier


  Access Paper or Ask Questions

Randomized Exploration in Generalized Linear Bandits


Jun 21, 2019
Branislav Kveton, Manzil Zaheer, Csaba Szepesvari, Lihong Li, Mohammad Ghavamzadeh, Craig Boutilier


  Access Paper or Ask Questions

Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology


May 31, 2019
Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Morgane Lustman, Vince Gatto, Paul Covington, Jim McFadden, Tushar Chandra, Craig Boutilier

* Short version to appear IJCAI-2019 

  Access Paper or Ask Questions

Advantage Amplification in Slowly Evolving Latent-State Environments


May 29, 2019
Martin Mladenov, Ofer Meshi, Jayden Ooi, Dale Schuurmans, Craig Boutilier


  Access Paper or Ask Questions

Perturbed-History Exploration in Stochastic Linear Bandits


Mar 21, 2019
Branislav Kveton, Csaba Szepesvari, Mohammad Ghavamzadeh, Craig Boutilier


  Access Paper or Ask Questions

Perturbed-History Exploration in Stochastic Multi-Armed Bandits


Feb 26, 2019
Branislav Kveton, Csaba Szepesvari, Mohammad Ghavamzadeh, Craig Boutilier


  Access Paper or Ask Questions

Seq2Slate: Re-ranking and Slate Optimization with RNNs


Oct 04, 2018
Irwan Bello, Sayali Kulkarni, Sagar Jain, Craig Boutilier, Ed Chi, Elad Eban, Xiyang Luo, Alan Mackey, Ofer Meshi


  Access Paper or Ask Questions

Planning and Learning with Stochastic Action Sets


May 07, 2018
Craig Boutilier, Alon Cohen, Amit Daniely, Avinatan Hassidim, Yishay Mansour, Ofer Meshi, Martin Mladenov, Dale Schuurmans


  Access Paper or Ask Questions

Safe Exploration for Identifying Linear Systems via Robust Optimization


Nov 30, 2017
Tyler Lu, Martin Zinkevich, Craig Boutilier, Binz Roy, Dale Schuurmans


  Access Paper or Ask Questions

Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (2000)


Aug 28, 2014
Craig Boutilier, Moises Goldszmidt


  Access Paper or Ask Questions

Structured Reachability Analysis for Markov Decision Processes


Apr 23, 2013
Craig Boutilier, Ronen I. Brafman, Christopher W. Geib

* Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998) 

  Access Paper or Ask Questions

Modal Logics for Qualitative Possibility and Beliefs


Mar 13, 2013
Craig Boutilier

* Appears in Proceedings of the Eighth Conference on Uncertainty in Artificial Intelligence (UAI1992) 

  Access Paper or Ask Questions

The Probability of a Possibility: Adding Uncertainty to Default Rules


Mar 06, 2013
Craig Boutilier

* Appears in Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (UAI1993) 

  Access Paper or Ask Questions