Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

Model-Based Reinforcement Learning with Value-Targeted Regression

Add code
Jun 01, 2020
Figure 1 for Model-Based Reinforcement Learning with Value-Targeted Regression
Figure 2 for Model-Based Reinforcement Learning with Value-Targeted Regression
Figure 3 for Model-Based Reinforcement Learning with Value-Targeted Regression
Figure 4 for Model-Based Reinforcement Learning with Value-Targeted Regression
Viaarxiv icon

On the Global Convergence Rates of Softmax Policy Gradient Methods

Add code
May 13, 2020
Figure 1 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Figure 2 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Figure 3 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Figure 4 for On the Global Convergence Rates of Softmax Policy Gradient Methods
Viaarxiv icon

Provably Efficient Adaptive Approximate Policy Iteration

Add code
Mar 15, 2020
Figure 1 for Provably Efficient Adaptive Approximate Policy Iteration
Figure 2 for Provably Efficient Adaptive Approximate Policy Iteration
Viaarxiv icon

Model Selection in Contextual Stochastic Bandit Problems

Add code
Mar 03, 2020
Figure 1 for Model Selection in Contextual Stochastic Bandit Problems
Figure 2 for Model Selection in Contextual Stochastic Bandit Problems
Viaarxiv icon

Differentiable Bandit Exploration

Add code
Feb 17, 2020
Figure 1 for Differentiable Bandit Exploration
Figure 2 for Differentiable Bandit Exploration
Figure 3 for Differentiable Bandit Exploration
Viaarxiv icon

Learning with Good Feature Representations in Bandits and in RL with a Generative Model

Add code
Nov 18, 2019
Viaarxiv icon

Autonomous exploration for navigating in non-stationary CMPs

Add code
Oct 18, 2019
Figure 1 for Autonomous exploration for navigating in non-stationary CMPs
Viaarxiv icon

Adaptive Exploration in Linear Contextual Bandit

Add code
Oct 15, 2019
Figure 1 for Adaptive Exploration in Linear Contextual Bandit
Figure 2 for Adaptive Exploration in Linear Contextual Bandit
Figure 3 for Adaptive Exploration in Linear Contextual Bandit
Viaarxiv icon

PAC-Bayes with Backprop

Add code
Oct 04, 2019
Figure 1 for PAC-Bayes with Backprop
Figure 2 for PAC-Bayes with Backprop
Figure 3 for PAC-Bayes with Backprop
Figure 4 for PAC-Bayes with Backprop
Viaarxiv icon

Exploration-Enhanced POLITEX

Add code
Aug 27, 2019
Figure 1 for Exploration-Enhanced POLITEX
Figure 2 for Exploration-Enhanced POLITEX
Figure 3 for Exploration-Enhanced POLITEX
Viaarxiv icon