Picture for Craig Boutilier

Craig Boutilier

University of Toronto

Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models

Add code
Sep 22, 2023
Figure 1 for Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models
Viaarxiv icon

Content Prompting: Modeling Content Provider Dynamics to Improve User Welfare in Recommender Ecosystems

Add code
Sep 02, 2023
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
May 25, 2023
Figure 1 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 2 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 3 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 4 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Feb 23, 2023
Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Add code
Feb 21, 2023
Viaarxiv icon

Reinforcement Learning with History-Dependent Dynamic Contexts

Add code
Feb 04, 2023
Figure 1 for Reinforcement Learning with History-Dependent Dynamic Contexts
Figure 2 for Reinforcement Learning with History-Dependent Dynamic Contexts
Viaarxiv icon

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

Add code
Oct 27, 2022
Viaarxiv icon

Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

Add code
Jul 25, 2022
Figure 1 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 2 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 3 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Figure 4 for Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Viaarxiv icon

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis

Add code
Jul 20, 2022
Figure 1 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 2 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 3 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 4 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Viaarxiv icon

A Mixture-of-Expert Approach to RL-based Dialogue Management

Add code
May 31, 2022
Figure 1 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 2 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 3 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 4 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Viaarxiv icon