Alert button
Picture for Mohammad Sadegh Talebi

Mohammad Sadegh Talebi

Alert button

Improved Exploration in Factored Average-Reward MDPs

Add code
Bookmark button
Alert button
Sep 09, 2020
Mohammad Sadegh Talebi, Anders Jonsson, Odalric-Ambrym Maillard

Figure 1 for Improved Exploration in Factored Average-Reward MDPs
Figure 2 for Improved Exploration in Factored Average-Reward MDPs
Figure 3 for Improved Exploration in Factored Average-Reward MDPs
Viaarxiv icon

Tightening Exploration in Upper Confidence Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 20, 2020
Hippolyte Bourel, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi

Figure 1 for Tightening Exploration in Upper Confidence Reinforcement Learning
Figure 2 for Tightening Exploration in Upper Confidence Reinforcement Learning
Figure 3 for Tightening Exploration in Upper Confidence Reinforcement Learning
Figure 4 for Tightening Exploration in Upper Confidence Reinforcement Learning
Viaarxiv icon

Model-Based Reinforcement Learning Exploiting State-Action Equivalence

Add code
Bookmark button
Alert button
Oct 09, 2019
Mahsa Asadi, Mohammad Sadegh Talebi, Hippolyte Bourel, Odalric-Ambrym Maillard

Figure 1 for Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Figure 2 for Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Figure 3 for Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Figure 4 for Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Viaarxiv icon

Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs

Add code
Bookmark button
Alert button
Mar 05, 2018
Mohammad Sadegh Talebi, Odalric-Ambrym Maillard

Figure 1 for Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs
Figure 2 for Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs
Figure 3 for Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs
Figure 4 for Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs
Viaarxiv icon