Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

INRIA Lille - Nord Europe

Multi-Step Greedy and Approximate Real Time Dynamic Programming

Add code
Sep 10, 2019
Figure 1 for Multi-Step Greedy and Approximate Real Time Dynamic Programming
Figure 2 for Multi-Step Greedy and Approximate Real Time Dynamic Programming
Viaarxiv icon

Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control

Add code
Sep 04, 2019
Figure 1 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 2 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 3 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 4 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Viaarxiv icon

Randomized Exploration in Generalized Linear Bandits

Add code
Jun 21, 2019
Figure 1 for Randomized Exploration in Generalized Linear Bandits
Figure 2 for Randomized Exploration in Generalized Linear Bandits
Viaarxiv icon

Active Learning for Binary Classification with Abstention

Add code
Jun 01, 2019
Figure 1 for Active Learning for Binary Classification with Abstention
Viaarxiv icon

Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies

Add code
May 27, 2019
Figure 1 for Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
Viaarxiv icon

Binary Classification with Bounded Abstention Rate

Add code
May 23, 2019
Figure 1 for Binary Classification with Bounded Abstention Rate
Figure 2 for Binary Classification with Bounded Abstention Rate
Figure 3 for Binary Classification with Bounded Abstention Rate
Figure 4 for Binary Classification with Bounded Abstention Rate
Viaarxiv icon

Perturbed-History Exploration in Stochastic Linear Bandits

Add code
Mar 21, 2019
Figure 1 for Perturbed-History Exploration in Stochastic Linear Bandits
Figure 2 for Perturbed-History Exploration in Stochastic Linear Bandits
Figure 3 for Perturbed-History Exploration in Stochastic Linear Bandits
Viaarxiv icon

Perturbed-History Exploration in Stochastic Multi-Armed Bandits

Add code
Feb 26, 2019
Figure 1 for Perturbed-History Exploration in Stochastic Multi-Armed Bandits
Viaarxiv icon

Lyapunov-based Safe Policy Optimization for Continuous Control

Add code
Jan 28, 2019
Figure 1 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 2 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 3 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 4 for Lyapunov-based Safe Policy Optimization for Continuous Control
Viaarxiv icon

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits

Add code
Nov 13, 2018
Figure 1 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Figure 2 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Viaarxiv icon