Picture for Michal Valko

Michal Valko

Sid

A Provably Efficient Sample Collection Strategy for Reinforcement Learning

Add code
Jul 13, 2020
Figure 1 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Figure 2 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Figure 3 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Figure 4 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Viaarxiv icon

A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces

Add code
Jul 09, 2020
Figure 1 for A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces
Figure 2 for A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces
Figure 3 for A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces
Figure 4 for A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces
Viaarxiv icon

Gamification of Pure Exploration for Linear Bandits

Add code
Jul 02, 2020
Figure 1 for Gamification of Pure Exploration for Linear Bandits
Figure 2 for Gamification of Pure Exploration for Linear Bandits
Figure 3 for Gamification of Pure Exploration for Linear Bandits
Figure 4 for Gamification of Pure Exploration for Linear Bandits
Viaarxiv icon

Sampling from a $k$-DPP without looking at all items

Add code
Jun 30, 2020
Figure 1 for Sampling from a $k$-DPP without looking at all items
Figure 2 for Sampling from a $k$-DPP without looking at all items
Figure 3 for Sampling from a $k$-DPP without looking at all items
Viaarxiv icon

Stochastic bandits with arm-dependent delays

Add code
Jun 18, 2020
Figure 1 for Stochastic bandits with arm-dependent delays
Figure 2 for Stochastic bandits with arm-dependent delays
Figure 3 for Stochastic bandits with arm-dependent delays
Figure 4 for Stochastic bandits with arm-dependent delays
Viaarxiv icon

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Add code
Jun 13, 2020
Figure 1 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 2 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 3 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 4 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Viaarxiv icon

Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits

Add code
Jun 11, 2020
Figure 1 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Figure 2 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Figure 3 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Figure 4 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Viaarxiv icon

Adaptive Reward-Free Exploration

Add code
Jun 11, 2020
Figure 1 for Adaptive Reward-Free Exploration
Figure 2 for Adaptive Reward-Free Exploration
Figure 3 for Adaptive Reward-Free Exploration
Viaarxiv icon

Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

Add code
Jun 10, 2020
Figure 1 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 2 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 3 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 4 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Viaarxiv icon

Regret Bounds for Kernel-Based Reinforcement Learning

Add code
Apr 12, 2020
Figure 1 for Regret Bounds for Kernel-Based Reinforcement Learning
Figure 2 for Regret Bounds for Kernel-Based Reinforcement Learning
Figure 3 for Regret Bounds for Kernel-Based Reinforcement Learning
Viaarxiv icon