Picture for Csaba Szepesvari

Csaba Szepesvari

Dj

Exploration by Optimisation in Partial Monitoring

Add code
Jul 24, 2019
Figure 1 for Exploration by Optimisation in Partial Monitoring
Figure 2 for Exploration by Optimisation in Partial Monitoring
Figure 3 for Exploration by Optimisation in Partial Monitoring
Figure 4 for Exploration by Optimisation in Partial Monitoring
Viaarxiv icon

Randomized Exploration in Generalized Linear Bandits

Add code
Jun 21, 2019
Figure 1 for Randomized Exploration in Generalized Linear Bandits
Figure 2 for Randomized Exploration in Generalized Linear Bandits
Viaarxiv icon

Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers

Add code
Apr 25, 2019
Figure 1 for Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Figure 2 for Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Figure 3 for Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Figure 4 for Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Viaarxiv icon

Empirical Bayes Regret Minimization

Add code
Apr 04, 2019
Figure 1 for Empirical Bayes Regret Minimization
Figure 2 for Empirical Bayes Regret Minimization
Figure 3 for Empirical Bayes Regret Minimization
Figure 4 for Empirical Bayes Regret Minimization
Viaarxiv icon

Perturbed-History Exploration in Stochastic Linear Bandits

Add code
Mar 21, 2019
Figure 1 for Perturbed-History Exploration in Stochastic Linear Bandits
Figure 2 for Perturbed-History Exploration in Stochastic Linear Bandits
Figure 3 for Perturbed-History Exploration in Stochastic Linear Bandits
Viaarxiv icon

An Exponential Efron-Stein Inequality for Lq Stable Learning Rules

Add code
Mar 12, 2019
Viaarxiv icon

Perturbed-History Exploration in Stochastic Multi-Armed Bandits

Add code
Feb 26, 2019
Figure 1 for Perturbed-History Exploration in Stochastic Multi-Armed Bandits
Viaarxiv icon

An Information-Theoretic Approach to Minimax Regret in Partial Monitoring

Add code
Feb 01, 2019
Figure 1 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Figure 2 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Figure 3 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Figure 4 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Viaarxiv icon

Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures

Add code
Dec 04, 2018
Figure 1 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 2 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 3 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 4 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Viaarxiv icon

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits

Add code
Nov 13, 2018
Figure 1 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Figure 2 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Viaarxiv icon