Alert button
Picture for Csaba Szepesvari

Csaba Szepesvari

Alert button

University of Alberta

Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm

Add code
Bookmark button
Alert button
Jun 16, 2014
Yasin Abbasi-Yadkori, Csaba Szepesvari

Figure 1 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Figure 2 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Figure 3 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Figure 4 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Viaarxiv icon

Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

Add code
Bookmark button
Alert button
Mar 12, 2013
Yasin Abbasi-Yadkori, Peter L. Bartlett, Csaba Szepesvari

Figure 1 for Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
Figure 2 for Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
Viaarxiv icon

Statistical Linear Estimation with Penalized Estimators: an Application to Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 27, 2012
Bernardo Avila Pires, Csaba Szepesvari

Viaarxiv icon

An Adaptive Algorithm for Finite Stochastic Partial Monitoring

Add code
Bookmark button
Alert button
Jun 27, 2012
Gabor Bartok, Navid Zolghadr, Csaba Szepesvari

Figure 1 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Figure 2 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Figure 3 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Figure 4 for An Adaptive Algorithm for Finite Stochastic Partial Monitoring
Viaarxiv icon

Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods

Add code
Bookmark button
Alert button
Jun 20, 2012
Gergely Neu, Csaba Szepesvari

Figure 1 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Figure 2 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Figure 3 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Figure 4 for Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Viaarxiv icon

Analysis of Kernel Mean Matching under Covariate Shift

Add code
Bookmark button
Alert button
Jun 18, 2012
Yaoliang Yu, Csaba Szepesvari

Viaarxiv icon

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

Add code
Bookmark button
Alert button
Jun 13, 2012
Richard S. Sutton, Csaba Szepesvari, Alborz Geramifard, Michael P. Bowling

Figure 1 for Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Figure 2 for Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Figure 3 for Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Viaarxiv icon

Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions

Add code
Bookmark button
Alert button
Jun 13, 2012
Alejandro Isaza, Csaba Szepesvari, Vadim Bulitko, Russell Greiner

Figure 1 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Figure 2 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Figure 3 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Figure 4 for Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Viaarxiv icon

PAC-Bayesian Policy Evaluation for Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 14, 2012
Mahdi MIlani Fard, Joelle Pineau, Csaba Szepesvari

Figure 1 for PAC-Bayesian Policy Evaluation for Reinforcement Learning
Figure 2 for PAC-Bayesian Policy Evaluation for Reinforcement Learning
Figure 3 for PAC-Bayesian Policy Evaluation for Reinforcement Learning
Viaarxiv icon

Alignment Based Kernel Learning with a Continuous Set of Base Kernels

Add code
Bookmark button
Alert button
Dec 20, 2011
Arash Afkanpour, Csaba Szepesvari, Michael Bowling

Figure 1 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Figure 2 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Figure 3 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Figure 4 for Alignment Based Kernel Learning with a Continuous Set of Base Kernels
Viaarxiv icon