Picture for Craig Boutilier

Craig Boutilier

University of Toronto

Embedding-Aligned Language Models

Add code
May 24, 2024
Viaarxiv icon

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Add code
Feb 25, 2024
Viaarxiv icon

Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval

Add code
Nov 15, 2023
Figure 1 for Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval
Figure 2 for Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval
Figure 3 for Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval
Figure 4 for Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval
Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Add code
Oct 22, 2023
Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Oct 09, 2023
Figure 1 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 2 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 3 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 4 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Viaarxiv icon

Demystifying Embedding Spaces using Large Language Models

Add code
Oct 06, 2023
Figure 1 for Demystifying Embedding Spaces using Large Language Models
Figure 2 for Demystifying Embedding Spaces using Large Language Models
Figure 3 for Demystifying Embedding Spaces using Large Language Models
Figure 4 for Demystifying Embedding Spaces using Large Language Models
Viaarxiv icon

Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models

Add code
Sep 22, 2023
Figure 1 for Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models
Viaarxiv icon

Content Prompting: Modeling Content Provider Dynamics to Improve User Welfare in Recommender Ecosystems

Add code
Sep 02, 2023
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
May 25, 2023
Figure 1 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 2 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 3 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 4 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Feb 23, 2023
Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon