Alert button
Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

Alert button

Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

Add code
Bookmark button
Alert button
Apr 02, 2024
Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh, Krishnamurthy Dvijotham, Jinwoo Shin, Kimin Lee

Viaarxiv icon

Contextual Bandits with Stage-wise Constraints

Add code
Bookmark button
Alert button
Jan 15, 2024
Aldo Pacchiano, Mohammad Ghavamzadeh, Peter Bartlett

Viaarxiv icon

Maximum Entropy Model Correction in Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 29, 2023
Amin Rakhsha, Mete Kemertas, Mohammad Ghavamzadeh, Amir-massoud Farahmand

Viaarxiv icon

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage

Add code
Bookmark button
Alert button
Oct 27, 2023
Kishan Panaganti, Zaiyan Xu, Dileep Kalathil, Mohammad Ghavamzadeh

Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Add code
Bookmark button
Alert button
Oct 22, 2023
Erdem Biyik, Fan Yao, Yinlam Chow, Alex Haig, Chih-wei Hsu, Mohammad Ghavamzadeh, Craig Boutilier

Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 09, 2023
Jihwan Jeong, Yinlam Chow, Guy Tennenholtz, Chih-Wei Hsu, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 2 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 3 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 4 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Viaarxiv icon

A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits

Add code
Bookmark button
Alert button
Jun 02, 2023
Mohammad Ghavamzadeh, Marek Petrik, Guy Tennenholtz

Figure 1 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Figure 2 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Figure 3 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Figure 4 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
May 25, 2023
Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee

Figure 1 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 2 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 3 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 4 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Viaarxiv icon

Private and Communication-Efficient Algorithms for Entropy Estimation

Add code
Bookmark button
Alert button
May 12, 2023
Gecia Bravo-Hermsdorff, Róbert Busa-Fekete, Mohammad Ghavamzadeh, Andres Muñoz Medina, Umar Syed

Viaarxiv icon

On Dynamic Program Decompositions of Static Risk Measures

Add code
Bookmark button
Alert button
Apr 24, 2023
Jia Lin Hau, Erick Delage, Mohammad Ghavamzadeh, Marek Petrik

Figure 1 for On Dynamic Program Decompositions of Static Risk Measures
Figure 2 for On Dynamic Program Decompositions of Static Risk Measures
Viaarxiv icon