Alert button
Picture for Craig Boutilier

Craig Boutilier

Alert button

Aligning Text-to-Image Models using Human Feedback

Add code
Bookmark button
Alert button
Feb 23, 2023
Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Shixiang Shane Gu

Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Add code
Bookmark button
Alert button
Feb 21, 2023
Dhawal Gupta, Yinlam Chow, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Figure 2 for Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Figure 3 for Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Figure 4 for Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Viaarxiv icon

Reinforcement Learning with History-Dependent Dynamic Contexts

Add code
Bookmark button
Alert button
Feb 04, 2023
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier

Figure 1 for Reinforcement Learning with History-Dependent Dynamic Contexts
Figure 2 for Reinforcement Learning with History-Dependent Dynamic Contexts
Viaarxiv icon

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

Add code
Bookmark button
Alert button
Oct 27, 2022
Michael L. Littman, Ifeoma Ajunwa, Guy Berger, Craig Boutilier, Morgan Currie, Finale Doshi-Velez, Gillian Hadfield, Michael C. Horowitz, Charles Isbell, Hiroaki Kitano, Karen Levy, Terah Lyons, Melanie Mitchell, Julie Shah, Steven Sloman, Shannon Vallor, Toby Walsh

Viaarxiv icon

Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 25, 2022
Deborah Cohen, Moonkyung Ryu, Yinlam Chow, Orgad Keller, Ido Greenberg, Avinatan Hassidim, Michael Fink, Yossi Matias, Idan Szpektor, Craig Boutilier, Gal Elidan

Figure 1 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 2 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 3 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 4 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Viaarxiv icon

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis

Add code
Bookmark button
Alert button
Jul 20, 2022
Jonathan Stray, Alon Halevy, Parisa Assar, Dylan Hadfield-Menell, Craig Boutilier, Amar Ashar, Lex Beattie, Michael Ekstrand, Claire Leibowicz, Connie Moon Sehat, Sara Johansen, Lianne Kerlin, David Vickrey, Spandana Singh, Sanne Vrijenhoek, Amy Zhang, McKane Andrus, Natali Helberger, Polina Proutskova, Tanushree Mitra, Nina Vasan

Figure 1 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 2 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 3 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 4 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Viaarxiv icon

A Mixture-of-Expert Approach to RL-based Dialogue Management

Add code
Bookmark button
Alert button
May 31, 2022
Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, MoonKyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 2 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 3 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 4 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Viaarxiv icon

Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors

Add code
Bookmark button
Alert button
Feb 06, 2022
Christina Göpfert, Yinlam Chow, Chih-wei Hsu, Ivan Vendrov, Tyler Lu, Deepak Ramachandran, Craig Boutilier

Figure 1 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Figure 2 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Figure 3 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Figure 4 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Viaarxiv icon

IMO$^3$: Interactive Multi-Objective Off-Policy Optimization

Add code
Bookmark button
Alert button
Jan 25, 2022
Nan Wang, Hongning Wang, Maryam Karimzadehgan, Branislav Kveton, Craig Boutilier

Figure 1 for IMO$^3$: Interactive Multi-Objective Off-Policy Optimization
Figure 2 for IMO$^3$: Interactive Multi-Objective Off-Policy Optimization
Figure 3 for IMO$^3$: Interactive Multi-Objective Off-Policy Optimization
Figure 4 for IMO$^3$: Interactive Multi-Objective Off-Policy Optimization
Viaarxiv icon

Thompson Sampling with a Mixture Prior

Add code
Bookmark button
Alert button
Jun 10, 2021
Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Thompson Sampling with a Mixture Prior
Figure 2 for Thompson Sampling with a Mixture Prior
Viaarxiv icon