Alert button
Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

Alert button

Collaborative Multi-agent Stochastic Linear Bandits

Add code
Bookmark button
Alert button
May 12, 2022
Ahmadreza Moradipari, Mohammad Ghavamzadeh, Mahnoosh Alizadeh

Figure 1 for Collaborative Multi-agent Stochastic Linear Bandits
Figure 2 for Collaborative Multi-agent Stochastic Linear Bandits
Viaarxiv icon

Multi-Environment Meta-Learning in Stochastic Linear Bandits

Add code
Bookmark button
Alert button
May 12, 2022
Ahmadreza Moradipari, Mohammad Ghavamzadeh, Taha Rajabzadeh, Christos Thrampoulidis, Mahnoosh Alizadeh

Figure 1 for Multi-Environment Meta-Learning in Stochastic Linear Bandits
Viaarxiv icon

Efficient Risk-Averse Reinforcement Learning

Add code
Bookmark button
Alert button
May 10, 2022
Ido Greenberg, Yinlam Chow, Mohammad Ghavamzadeh, Shie Mannor

Figure 1 for Efficient Risk-Averse Reinforcement Learning
Figure 2 for Efficient Risk-Averse Reinforcement Learning
Figure 3 for Efficient Risk-Averse Reinforcement Learning
Figure 4 for Efficient Risk-Averse Reinforcement Learning
Viaarxiv icon

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

Add code
Bookmark button
Alert button
Mar 13, 2022
MohammadJavad Azizi, Thang Duong, Yasin Abbasi-Yadkori, András György, Claire Vernade, Mohammad Ghavamzadeh

Figure 1 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 2 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 3 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 4 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Viaarxiv icon

Meta-Learning for Simple Regret Minimization

Add code
Bookmark button
Alert button
Feb 25, 2022
Mohammadjavad Azizi, Branislav Kveton, Mohammad Ghavamzadeh, Sumeet Katariya

Figure 1 for Meta-Learning for Simple Regret Minimization
Figure 2 for Meta-Learning for Simple Regret Minimization
Figure 3 for Meta-Learning for Simple Regret Minimization
Figure 4 for Meta-Learning for Simple Regret Minimization
Viaarxiv icon

Deep Hierarchy in Bandits

Add code
Bookmark button
Alert button
Feb 03, 2022
Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh

Figure 1 for Deep Hierarchy in Bandits
Figure 2 for Deep Hierarchy in Bandits
Figure 3 for Deep Hierarchy in Bandits
Figure 4 for Deep Hierarchy in Bandits
Viaarxiv icon

Hierarchical Bayesian Bandits

Add code
Bookmark button
Alert button
Nov 12, 2021
Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh

Figure 1 for Hierarchical Bayesian Bandits
Figure 2 for Hierarchical Bayesian Bandits
Figure 3 for Hierarchical Bayesian Bandits
Viaarxiv icon

Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm

Add code
Bookmark button
Alert button
Jun 22, 2021
MohammadJavad Azizi, Branislav Kveton, Mohammad Ghavamzadeh

Figure 1 for Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm
Figure 2 for Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm
Figure 3 for Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm
Figure 4 for Fixed-Budget Best-Arm Identification in Contextual Bandits: A Static-Adaptive Algorithm
Viaarxiv icon

Thompson Sampling with a Mixture Prior

Add code
Bookmark button
Alert button
Jun 10, 2021
Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Thompson Sampling with a Mixture Prior
Figure 2 for Thompson Sampling with a Mixture Prior
Viaarxiv icon