Picture for Michal Valko

Michal Valko

Sid

Regret Bounds for Kernel-Based Reinforcement Learning

Add code
Apr 12, 2020
Figure 1 for Regret Bounds for Kernel-Based Reinforcement Learning
Figure 2 for Regret Bounds for Kernel-Based Reinforcement Learning
Figure 3 for Regret Bounds for Kernel-Based Reinforcement Learning
Viaarxiv icon

Taylor Expansion Policy Optimization

Add code
Mar 13, 2020
Figure 1 for Taylor Expansion Policy Optimization
Figure 2 for Taylor Expansion Policy Optimization
Figure 3 for Taylor Expansion Policy Optimization
Figure 4 for Taylor Expansion Policy Optimization
Viaarxiv icon

Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification

Add code
Feb 26, 2020
Figure 1 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 2 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 3 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 4 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Viaarxiv icon

No-Regret Exploration in Goal-Oriented Reinforcement Learning

Add code
Jan 30, 2020
Figure 1 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Figure 2 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Figure 3 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Figure 4 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Viaarxiv icon

Multiagent Evaluation under Incomplete Information

Add code
Oct 30, 2019
Figure 1 for Multiagent Evaluation under Incomplete Information
Figure 2 for Multiagent Evaluation under Incomplete Information
Figure 3 for Multiagent Evaluation under Incomplete Information
Figure 4 for Multiagent Evaluation under Incomplete Information
Viaarxiv icon

Fixed-Confidence Guarantees for Bayesian Best-Arm Identification

Add code
Oct 28, 2019
Figure 1 for Fixed-Confidence Guarantees for Bayesian Best-Arm Identification
Figure 2 for Fixed-Confidence Guarantees for Bayesian Best-Arm Identification
Viaarxiv icon

Derivative-Free & Order-Robust Optimisation

Add code
Oct 22, 2019
Figure 1 for Derivative-Free & Order-Robust Optimisation
Figure 2 for Derivative-Free & Order-Robust Optimisation
Figure 3 for Derivative-Free & Order-Robust Optimisation
Viaarxiv icon

Exact sampling of determinantal point processes with sublinear time preprocessing

Add code
May 31, 2019
Figure 1 for Exact sampling of determinantal point processes with sublinear time preprocessing
Figure 2 for Exact sampling of determinantal point processes with sublinear time preprocessing
Figure 3 for Exact sampling of determinantal point processes with sublinear time preprocessing
Viaarxiv icon

Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret

Add code
Mar 13, 2019
Figure 1 for Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret
Viaarxiv icon

Exploiting Structure of Uncertainty for Efficient Combinatorial Semi-Bandits

Add code
Feb 11, 2019
Figure 1 for Exploiting Structure of Uncertainty for Efficient Combinatorial Semi-Bandits
Figure 2 for Exploiting Structure of Uncertainty for Efficient Combinatorial Semi-Bandits
Figure 3 for Exploiting Structure of Uncertainty for Efficient Combinatorial Semi-Bandits
Figure 4 for Exploiting Structure of Uncertainty for Efficient Combinatorial Semi-Bandits
Viaarxiv icon