Alert button
Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

Alert button

Mirror Descent Policy Optimization

Add code
Bookmark button
Alert button
May 20, 2020
Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh

Figure 1 for Mirror Descent Policy Optimization
Figure 2 for Mirror Descent Policy Optimization
Figure 3 for Mirror Descent Policy Optimization
Figure 4 for Mirror Descent Policy Optimization
Viaarxiv icon

Active Model Estimation in Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 06, 2020
Jean Tarbouriech, Shubhanshu Shekhar, Matteo Pirotta, Mohammad Ghavamzadeh, Alessandro Lazaric

Figure 1 for Active Model Estimation in Markov Decision Processes
Figure 2 for Active Model Estimation in Markov Decision Processes
Figure 3 for Active Model Estimation in Markov Decision Processes
Figure 4 for Active Model Estimation in Markov Decision Processes
Viaarxiv icon

Predictive Coding for Locally-Linear Control

Add code
Bookmark button
Alert button
Mar 02, 2020
Rui Shu, Tung Nguyen, Yinlam Chow, Tuan Pham, Khoat Than, Mohammad Ghavamzadeh, Stefano Ermon, Hung H. Bui

Figure 1 for Predictive Coding for Locally-Linear Control
Figure 2 for Predictive Coding for Locally-Linear Control
Figure 3 for Predictive Coding for Locally-Linear Control
Figure 4 for Predictive Coding for Locally-Linear Control
Viaarxiv icon

Policy-Aware Model Learning for Policy Gradient Methods

Add code
Bookmark button
Alert button
Feb 28, 2020
Romina Abachi, Mohammad Ghavamzadeh, Amir-massoud Farahmand

Figure 1 for Policy-Aware Model Learning for Policy Gradient Methods
Figure 2 for Policy-Aware Model Learning for Policy Gradient Methods
Figure 3 for Policy-Aware Model Learning for Policy Gradient Methods
Figure 4 for Policy-Aware Model Learning for Policy Gradient Methods
Viaarxiv icon

Improved Algorithms for Conservative Exploration in Bandits

Add code
Bookmark button
Alert button
Feb 08, 2020
Evrard Garcelon, Mohammad Ghavamzadeh, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Improved Algorithms for Conservative Exploration in Bandits
Figure 2 for Improved Algorithms for Conservative Exploration in Bandits
Figure 3 for Improved Algorithms for Conservative Exploration in Bandits
Figure 4 for Improved Algorithms for Conservative Exploration in Bandits
Viaarxiv icon

Conservative Exploration in Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 08, 2020
Evrard Garcelon, Mohammad Ghavamzadeh, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Conservative Exploration in Reinforcement Learning
Figure 2 for Conservative Exploration in Reinforcement Learning
Figure 3 for Conservative Exploration in Reinforcement Learning
Figure 4 for Conservative Exploration in Reinforcement Learning
Viaarxiv icon

Adaptive Sampling for Estimating Multiple Probability Distributions

Add code
Bookmark button
Alert button
Dec 07, 2019
Shubhanshu Shekhar, Tara Javidi, Mohammad Ghavamzadeh

Figure 1 for Adaptive Sampling for Estimating Multiple Probability Distributions
Figure 2 for Adaptive Sampling for Estimating Multiple Probability Distributions
Figure 3 for Adaptive Sampling for Estimating Multiple Probability Distributions
Figure 4 for Adaptive Sampling for Estimating Multiple Probability Distributions
Viaarxiv icon

Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 14, 2019
Manan Tomar, Yonathan Efroni, Mohammad Ghavamzadeh

Figure 1 for Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning
Figure 2 for Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning
Figure 3 for Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning
Figure 4 for Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning
Viaarxiv icon