Alert button
Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

Alert button

Contextual Bandits with Stage-wise Constraints

Jan 15, 2024
Aldo Pacchiano, Mohammad Ghavamzadeh, Peter Bartlett

Viaarxiv icon

Maximum Entropy Model Correction in Reinforcement Learning

Nov 29, 2023
Amin Rakhsha, Mete Kemertas, Mohammad Ghavamzadeh, Amir-massoud Farahmand

Viaarxiv icon

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage

Oct 27, 2023
Kishan Panaganti, Zaiyan Xu, Dileep Kalathil, Mohammad Ghavamzadeh

Viaarxiv icon

Preference Elicitation with Soft Attributes in Interactive Recommendation

Oct 22, 2023
Erdem Biyik, Fan Yao, Yinlam Chow, Alex Haig, Chih-wei Hsu, Mohammad Ghavamzadeh, Craig Boutilier

Viaarxiv icon

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Oct 09, 2023
Jihwan Jeong, Yinlam Chow, Guy Tennenholtz, Chih-Wei Hsu, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 2 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 3 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 4 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Viaarxiv icon

A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits

Jun 02, 2023
Mohammad Ghavamzadeh, Marek Petrik, Guy Tennenholtz

Figure 1 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Figure 2 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Figure 3 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Figure 4 for A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits
Viaarxiv icon

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

May 25, 2023
Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee

Figure 1 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 2 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 3 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Figure 4 for DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Viaarxiv icon

Private and Communication-Efficient Algorithms for Entropy Estimation

May 12, 2023
Gecia Bravo-Hermsdorff, Róbert Busa-Fekete, Mohammad Ghavamzadeh, Andres Muñoz Medina, Umar Syed

Viaarxiv icon

On Dynamic Program Decompositions of Static Risk Measures

Apr 24, 2023
Jia Lin Hau, Erick Delage, Mohammad Ghavamzadeh, Marek Petrik

Figure 1 for On Dynamic Program Decompositions of Static Risk Measures
Figure 2 for On Dynamic Program Decompositions of Static Risk Measures
Viaarxiv icon

A Review of Deep Learning for Video Captioning

Apr 22, 2023
Moloud Abdar, Meenakshi Kollati, Swaraja Kuraparthi, Farhad Pourpanah, Daniel McDuff, Mohammad Ghavamzadeh, Shuicheng Yan, Abduallah Mohamed, Abbas Khosravi, Erik Cambria, Fatih Porikli

Figure 1 for A Review of Deep Learning for Video Captioning
Figure 2 for A Review of Deep Learning for Video Captioning
Figure 3 for A Review of Deep Learning for Video Captioning
Figure 4 for A Review of Deep Learning for Video Captioning
Viaarxiv icon