Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

Statistical Linear Estimation with Penalized Estimators: an Application to Reinforcement Learning

Add code
Jun 27, 2012
Viaarxiv icon

An Adaptive Algorithm for Finite Stochastic Partial Monitoring

Add code
Jun 27, 2012
Figure 1 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Figure 2 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Figure 3 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Figure 4 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Viaarxiv icon

Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods

Add code
Jun 20, 2012
Figure 1 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Figure 2 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Figure 3 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Figure 4 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Viaarxiv icon

Analysis of Kernel Mean Matching under Covariate Shift

Add code
Jun 18, 2012
Viaarxiv icon

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

Add code
Jun 13, 2012
Figure 1 for Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Figure 2 for Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Figure 3 for Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Viaarxiv icon

Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions

Add code
Jun 13, 2012
Figure 1 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Figure 2 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Figure 3 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Figure 4 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Viaarxiv icon

PAC-Bayesian Policy Evaluation for Reinforcement Learning

Add code
Feb 14, 2012
Figure 1 for PAC-Bayesian Policy Evaluation for Reinforcement Learning
Figure 2 for PAC-Bayesian Policy Evaluation for Reinforcement Learning
Figure 3 for PAC-Bayesian Policy Evaluation for Reinforcement Learning
Viaarxiv icon

Alignment Based Kernel Learning with a Continuous Set of Base Kernels

Add code
Dec 20, 2011
Figure 1 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Figure 2 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Figure 3 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Figure 4 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Viaarxiv icon

X-Armed Bandits

Add code
Apr 13, 2011
Figure 1 for X-Armed Bandits
Figure 2 for X-Armed Bandits
Figure 3 for X-Armed Bandits
Viaarxiv icon

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

Add code
Feb 14, 2011
Figure 1 for Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems
Viaarxiv icon