Alert button
Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

Alert button

Benchmarking Batch Deep Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Oct 03, 2019
Scott Fujimoto, Edoardo Conti, Mohammad Ghavamzadeh, Joelle Pineau

Figure 1 for Benchmarking Batch Deep Reinforcement Learning Algorithms
Viaarxiv icon

Multi-Step Greedy and Approximate Real Time Dynamic Programming

Add code
Bookmark button
Alert button
Sep 10, 2019
Yonathan Efroni, Mohammad Ghavamzadeh, Shie Mannor

Figure 1 for Multi-Step Greedy and Approximate Real Time Dynamic Programming
Figure 2 for Multi-Step Greedy and Approximate Real Time Dynamic Programming
Viaarxiv icon

Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control

Add code
Bookmark button
Alert button
Sep 04, 2019
Nir Levine, Yinlam Chow, Rui Shu, Ang Li, Mohammad Ghavamzadeh, Hung Bui

Figure 1 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 2 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 3 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 4 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Viaarxiv icon

Randomized Exploration in Generalized Linear Bandits

Add code
Bookmark button
Alert button
Jun 21, 2019
Branislav Kveton, Manzil Zaheer, Csaba Szepesvari, Lihong Li, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Randomized Exploration in Generalized Linear Bandits
Figure 2 for Randomized Exploration in Generalized Linear Bandits
Viaarxiv icon

Active Learning for Binary Classification with Abstention

Add code
Bookmark button
Alert button
Jun 01, 2019
Shubhanshu Shekhar, Mohammad Ghavamzadeh, Tara Javidi

Figure 1 for Active Learning for Binary Classification with Abstention
Viaarxiv icon

Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies

Add code
Bookmark button
Alert button
May 27, 2019
Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor

Figure 1 for Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
Viaarxiv icon

Binary Classification with Bounded Abstention Rate

Add code
Bookmark button
Alert button
May 23, 2019
Shubhanshu Shekhar, Mohammad Ghavamzadeh, Tara Javidi

Figure 1 for Binary Classification with Bounded Abstention Rate
Figure 2 for Binary Classification with Bounded Abstention Rate
Figure 3 for Binary Classification with Bounded Abstention Rate
Figure 4 for Binary Classification with Bounded Abstention Rate
Viaarxiv icon

Perturbed-History Exploration in Stochastic Linear Bandits

Add code
Bookmark button
Alert button
Mar 21, 2019
Branislav Kveton, Csaba Szepesvari, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Perturbed-History Exploration in Stochastic Linear Bandits
Figure 2 for Perturbed-History Exploration in Stochastic Linear Bandits
Figure 3 for Perturbed-History Exploration in Stochastic Linear Bandits
Viaarxiv icon

Perturbed-History Exploration in Stochastic Multi-Armed Bandits

Add code
Bookmark button
Alert button
Feb 26, 2019
Branislav Kveton, Csaba Szepesvari, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Perturbed-History Exploration in Stochastic Multi-Armed Bandits
Viaarxiv icon

Lyapunov-based Safe Policy Optimization for Continuous Control

Add code
Bookmark button
Alert button
Jan 28, 2019
Yinlam Chow, Ofir Nachum, Aleksandra Faust, Mohammad Ghavamzadeh, Edgar Duenez-Guzman

Figure 1 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 2 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 3 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 4 for Lyapunov-based Safe Policy Optimization for Continuous Control
Viaarxiv icon