Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

Stochastic Rank-1 Bandits

Add code
Mar 08, 2017
Figure 1 for Stochastic Rank-1 Bandits
Figure 2 for Stochastic Rank-1 Bandits
Figure 3 for Stochastic Rank-1 Bandits
Viaarxiv icon

Sequential Learning without Feedback

Add code
Oct 18, 2016
Figure 1 for Sequential Learning without Feedback
Figure 2 for Sequential Learning without Feedback
Figure 3 for Sequential Learning without Feedback
Figure 4 for Sequential Learning without Feedback
Viaarxiv icon

The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits

Add code
Oct 14, 2016
Viaarxiv icon

Learning with a Strong Adversary

Add code
Jan 16, 2016
Figure 1 for Learning with a Strong Adversary
Figure 2 for Learning with a Strong Adversary
Figure 3 for Learning with a Strong Adversary
Figure 4 for Learning with a Strong Adversary
Viaarxiv icon

Combinatorial Cascading Bandits

Add code
Nov 17, 2015
Figure 1 for Combinatorial Cascading Bandits
Figure 2 for Combinatorial Cascading Bandits
Figure 3 for Combinatorial Cascading Bandits
Viaarxiv icon

Cascading Bandits: Learning to Rank in the Cascade Model

Add code
May 18, 2015
Figure 1 for Cascading Bandits: Learning to Rank in the Cascade Model
Figure 2 for Cascading Bandits: Learning to Rank in the Cascade Model
Figure 3 for Cascading Bandits: Learning to Rank in the Cascade Model
Viaarxiv icon

Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits

Add code
Jan 27, 2015
Figure 1 for Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits
Viaarxiv icon

On Minimax Optimal Offline Policy Evaluation

Add code
Sep 12, 2014
Figure 1 for On Minimax Optimal Offline Policy Evaluation
Viaarxiv icon

Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm

Add code
Jun 16, 2014
Figure 1 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Figure 2 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Figure 3 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Figure 4 for Bayesian Optimal Control of Smoothly Parameterized Systems: The Lazy Posterior Sampling Algorithm
Viaarxiv icon

Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

Add code
Mar 12, 2013
Figure 1 for Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
Figure 2 for Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
Viaarxiv icon